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(54) Data structure for image transmission, image coding method, and image decoding method 

(57) There is provided an image decoding appara- 
tus for decoding a coded image signal having an image 
identifier based on a data structure of the image signal. 
This apparatus comprises a data analyzer for analyzing 
the coded image signal with reference to the image 
identifier to decide whether the coded image signal is a 
coded arbitrary shape signal including both of coded 
shape bit streams obtained by coding a shape signal 
which represents the shape of each object as one of the 
components of a display image and coded pixel value 
bit streams obtained by coding a pixel value signal rep- 
resenting the gradation of the object, or a coded binary 
signal including only coded shape bit streams obtained 
by coding a shape signal representing a display image 
of binary information; a first decoder for decoding the 
coded shape bit streams by a first decoding process; 
and a second decoder for decoding the coded pixel 
value bit streams by a second decoding process. The 
first decoding process and the second decoding proc- 
ess are appropriately switched according to each coded 
image signal. Therefore, plural coded image signals 
having different data structures and produced by differ- 
ent coding methods can be decoded in decoding proc- 
esses corresponding to a single coding method. 
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Description 

FIELD OF THE INVENTION 

The present invention relates to a data structure for 
image transmission, an image coding method, an image 
decoding method, an image coding apparatus, an 
image decoding apparatus, a data storage medium 
which contains a program for implementing an image 
decoding process, and a data storage medium which 
contains a coded image signal. 

More particularly, the invention relates to a data 
structure for image transmission thai makes coded 
image signals decodable by decoding processes corre- 
sponding to a single coding method, which coded image 
signals have different data structures obtained by cod- 
ing digital image signals corresponding to different 
schemes. Further, the invention relates to an image 
coding method and an image coding apparatus for gen- 
erating a coded image signal having the above- 
described data structure for image transmission, and an 
image decoding method and an image decoding appa- 
ratus for decoding a coded image signal having the data 
structure for image transmission. 

Furthermore, the invention relates to a data storage 
medium containing a program for implementing the 
decoding process according to the above-described 
image decoding method, and a data storage medium 
containing a coded image signal having the above- 
described data structure for image transmission. 

BACKGROUND OF THE INVENTION 

In order to store or transmit digital image informa- 
tion with high efficiency, it is necessary to compressively 
code the digital image information. As a typical method 
for compressive coding of digital image information, 
there is DCT (Discrete Cosine Transformation) repre- 
sented by JPEG (Joint Photographic Experts Group) 
and MPEG (Moving Picture Experts Group). Besides, 
there are waveform coding methods such as sub-band 
coding, wavelet coding, and fractal coding. 

Further, in order to eliminate redundant image infor- 
mation between display images, such as adjacent 
frames, inter-frame prediction using motion compensa- 
tion is carried out. That is, a pixel value of a pixel in the 
present frame is expressed using a difference between 
this pixel value and a pixel value of a pixel in the previ- 
ous frame, and this difference signal is subjected to 
waveform coding. 

To be specific, an arithmetic encoder 10a as shown 
in figure 22(a) is employed for coding a binary image 
signal S2 which represents a display image of binary 
information and is obtained from a scanner of a facsim- 
ile machine or the like. For decoding a coded binary sig- 
nal E2, an arithmetic decoder 10b is employed as 
shown in figure 22(b). The arithmetic encoder 10a 
encodes the binary image signal S2 by an arithmetic 



coding process which is used when transmitting a fac- 
simile signal, such as MMR (Modified Modified Reed) or 
JBIG (Joint Bi-level Image Coding Experts Group), 
thereby generates a coded binary signal E2. The arith- 

5 metic decoding apparatus 10b decodes the coded 
binary signal E2 by an arithmetic decoding process cor- 
responding to the arithmetic coding process, thereby 
regenerates a decoded binary signal D2. 

As shown in figure 22(c), a coded binary signal 

w 600a (E2) corresponding to one display image includes 
a synchronous signal 601 at the beginning, a header 
603 that follows signal 601 , and shape data 604 that fol- 
lows the header 603. 

Further, an image coding apparatus 20 shown in 

rs figure 23(a) is employed for coding a digital image sig- 
nal St treated in MPEG2, and an image decoding appa- 
ratus 25 shown in figure 23(b) is employed for decoding 
a coded image signal Et. The digital image signal St 
treated in MPEG2 is a rectangle image signal which 

20 includes a luminance signal and a color difference sig- 
nal for color display (gradation display), and information 
showing the horizontal and vertical size of an image on 
one display image (one frame). The image coding appa- 
ratus 20 comprises an information source encoder 20a 

ss which subjects the digital image signal (rectangle image 
signal) St to information source coding, and a variable- 
length encoder 20b which subjects the output from the 
encoder 20a to variable-length coding to generate a 
coded image signal (coded pixel value signal) Et. The 

30 image decoding apparatus 25 comprises a variable- 
lenglh decoder 25b which subjects the coded image 
signal Et to variable-length decoding, and an informa- 
tion source decoder 25a which subjects the output from 
the decoder 25b to information source decoding to gen- 

35 erate a decoded image signal (decoded pixel value sig- 
nal) Dt. 

The information source encoder 20a comprises a 
DCT processor 21 which subjects each of plural blocks, 
into which a display image (one frame) is divided, to 

w DCT (Discrete Cosine Transform), and a quantizer 22 
which quantizes the output from the DCT processor 21 . 
The information source decoder 25a comprises an 
inverse quantizer 26 which inversely quantizes the out- 
put from the variable-length decoder 25b, and an IDCT 

4s processor 27 which subjects the output from the 
decoder 25b to inverse DCT. As shown in figure 23(c), a 
coded image signal 700a (Et) corresponding to one dis- 
play image includes a 32-bit synchronous signal 701 at 
the beginning, a header 703 that follows signal 701 , and 

so coded pixel value bit streams (coded texture bit 
streams) 71C1, 71C2, 71 C3, ... corresponding to blocks 

C1, C2, C3 into which the display image is divided, 

respectively. The coded texture bit streams 71 C1, 
71C2, and 71C3 include 5-bit quantization scales 704. 

55 707. and 710. variable-length texture motion vectors 
(MV) 705, 708, and 711, and variable-length texture 
DCT coefficients 706, 709, and 712, respectively. 

In recent years, a method for compressively coding 
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and transmitting an image signal in the basis of individ- 
ual object has been put to practical use. More specifi- 
cally, an image, corresponding to one display image and 
composed of plural objects, is subjected to compressive 
coding and transmission in an object-wise manner, 5 
thereby increases the data compression ratio and ena- 
bles decoding/reproducing of the objects separately. In 
this method, on the reproduction end, coded image sig- 
nals corresponding to the respective objects are 
decoded and reproduced, and the reproduced image w 
signals are synthesized to display an image corre- 
sponding to one display image. This object-by-object 
coding enables the user to freely combine images of 
objects to be displayed, whereby editing of a moving 
picture is facilitated. Furthermore, in this method, it is 15 
possible to display a moving picture without reproducing 
images of relatively unimportant objects, according to 
the congestion of the transmission line, the perform- 
ance of reproduction apparatus, and the preference of 
the viewer. In other words, scalability in object units, i.e., so 
to change the contraction scale of image display for 
each object, is realized. 

In the object-by-object compressive coding of an 
image signal, since the respective objects have different 
shapes, an image signal of an arbitrary shape image 25 
(hereinafter, referred to as an arbitrary shape image sig- 
nal) is subjected to compressive coding. The arbitrary 
shape image signal includes a texture signal (pixel 
value signal) for color display of an object (gradation dis- 
play) and comprising a luminance signal and a color dif- 30 
ference signal, and a shape signal representing the 
shape of an image. The shape signal indicates whether 
each pixel as a component of a display region is located 
outside the object or inside the object, and it is 
expressed by binary digit. 35 

Further, there is a case where the arbitrary shape 
image signal includes transparency information repre- 
senting the transparency of an object when the object is 
placed as a foreground image on a background image, 
in addition to the texture signal and the shape signal, to 
The transparency information is usually expressed by a 
multivalued transparent signal of at least three bits. A 
combination of the binary shape signal (binary transpar- 
ency signal) and the multivalued transparency signal is 
called a transparency signal. The multivalued transpar- 45 
ency signal in the transparency signal is treated identi- 
cally to a texture signal in the following coding process. 

When an arbitrary shape image signal including 
both of a texture signal and a binary shape signal is 
coded, initially, the shape signal is coded and, thereaf- so 
ter, the texture signal is coded. In MPEG4, coding, 
transmission, and decoding of such an arbitrary shape 
image signal are being standardized, and figure 24(a) is 
a block diagram showing an image coding apparatus 
which performs a coding process currently being stand- 55 
ardized as MPEG4. 

In figure 24(a), reference numeral 200a designates 
an image coding apparatus which extracts an arbitrary 



shape image signal Sp corresponding to each of plural 
objects constituting a display image, according to a 
video signal Sv output from a camera or an image 
recording/reproduction apparatus (VTR), and encodes 
the arbitrary shape image signal. 

The image coding apparatus 200a includes a chro- 
makey processor 201 which subjects the video signal 
Sv to a chromakey process as follows. Initially, the chro- 
makey processor 201 separates an arbitrary shape 
image signal corresponding to each object from a back- 
ground image signal to a shape signal Spk representing 
the shape of the object as binary information and a tex- 
ture signal (pixel value signal) Spt for color display of the 
object and comprising a luminance signal and a color 
difference signal. Then, the chromakey processor 201 
outputs the signals Spk and Spt for each of plural blocks 
into which a display region corresponding to each object 
on the display image is divided. When outputting the 
signals Spk and Spt, the chromakey processor 201 out- 
puts a switch timing signal Ts representing the timing of 
switching between the shape signal Spk and the texture 
signal Spt. Further, the image coding apparatus 200a 
includes an arithmetic encoder 120a which codes the 
shape signal Spk, block by block, by arithmetic coding 
(refer to JBIG); an information source encoder 130a 
which performs DCT and quantization of the texture sig- 
nal Spt block by block; and a variable-length encoder 
139 which performs variable-length coding of the output 
from the information source encoder 130a. 

Further, the image coding apparatus 200a includes 
a switch 202 which connects the output from the chro- 
makey processor 201 with one of the input of the arith- 
metic encoder 120a and the input of the information 
source encoder 130a, in response to a switch timing 
signal Ts; and a multiplexer 150 which multiplexes a 
coded shape bit stream Epk output from the arithmetic 
encoder 1 20a and a coded texture bit stream Ept output 
from the variable-length encoder 139 together with 
other necessary signals. As shown in figure 24(c), a 
coded arbitrary shape signal Ep (500a), in which coded 
shape bit streams (Epk) 51 A1, 51 A2, and 51 A3, coded 
texture bit streams (Ept) 52 A1, 52 A2, and 52 A3, and 
other necessary signals are arranged in prescribed 
order, is output from the multiplexer 150. 

The above-described arithmetic coding process is 
adopted in a method of transmitting a facsimile signal, 
such as MMR or JBIG, and the above-described DCT 
process is adopted in the MPEG standard. As shown in 
figure 24(c). the coded data of each of blocks A1. A2 
and A3 is composed of the coded shape bit stream Epk 
and the coded texture bit stream Ept. 

In the coding apparatus 200a so constructed, the 
video signal Sv is processed by the chromakey proces- 
sor 201 , and an arbitrary shape image signal Sp corre- 
sponding to each object is output from the chromakey 
processor 201 . The shape signal Spk included in the 
arbitrary shape image signal Sp is input to the arithme- 
tic encoder 120a by the switch 202 controlled by the 
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switch timing signal Ts, coded by the encoder 120a, and 
output as a coded shape bit stream Epk toward the mul- 
tiplexer 150. On the other hand, the texture signal Spt 
included in the arbitrary shape image signal Sp is input 
to the information source encoder 130a by the switch 
202 controlled by the switch timing signal Ts, subjected 
to DCT and quantization in the encoder 1 30a, and out- 
put as a coded texture bit stream Ept toward the multi- 
plexer 150. The coding of the shape signal Spk and the 
coding of the texture signal Spt are carried out block by 
block. 

In the multiplexer 150, the coded shape bit stream 
Epk, the coded texture bit stream Ept, and other 
required signals are arranged in prescribed order, and 
these streams and signals are output from the multi- 
plexer 150 as a coded arbitrary shape signal Ep. 

The image decoding apparatus 200b shown in fig- 
ure 24(b) is used for decoding of the arbitrary shape sig- 
nal Ep which has been coded by the image coding 
apparatus 200a. 

The image decoding apparatus 200b comprises a 
data analyzer 1 60 which analyzes the coded arbitrary 
shape signal Ep and outputs a control signal SWb; an 
arithmetic decoder 170a which subjects the coded 
shape bit stream Epk included in the coded arbitrary 
shape signal Ep to block-by-block arithmetic decoding, 
and generates an end timing signal Te when arithmetic 
decoding of one block has ended; an information source 
decoder 180a which subjects the coded texture bit 
stream Ept included in the coded arbitrary shape signal 
Ep to information source decoding, i.e., inverse DCT 
and inverse quantization; a switch 101b which supplies 
the coded arbitrary shape signal Ep output from the 
data analyzer 160 to either the arithmetic decoder 170a 
or the information source decoder 180a, in response to 
the control signal SWb and the end timing signal Te; and 
a synthesizer 190 which synthesizes outputs Dpk and 
□pt from the decoders 170a and 180a and outputs the 
synthesized signal as a decoded arbitrary shape signal 
Dp. 

In the image decoding apparatus 200b so con- 
structed, receiving the coded arbitrary shape signal Ep, 
the data analyzer 160 analyzes information included in 
this signal, and outputs the control signal SWb toward 
the switch 1 01 b when it detects the last bit of the texture 
bit stream Ept. In response to the control signal SWb, 
the switch 101b supplies the output from the data ana- 
lyzer 1 60 to the arithmetic decoder 1 70a. The arithmetic 
decoder 1 70a decodes the coded shape bit stream Epk, 
and outputs the end timing signal Te when decoding of 
the coded shape bit stream Epk corresponding to each 
block has ended. When the end timing signal Te is input 
to the data analyzer 1 60, the data analyzer 1 60 outputs 
the control signal SWb to the switch 101b, and the 
switch 101b connects the output of the data analyzer 
160 to the information source decoder 180a. The infor- 
mation source decoder 1 80a decodes one block of the 
coded texture signal Ept included in the coded arbitrary 



shape signal Ep. The synthesizer 1 90 synthesizes the 
output from the arithmetic decoder 170a and the output 
from the information source decoder 180a, and outputs 
the decoded arbitrary shape signal Dp as a reproduced 

5 signal. When the above-mentioned decoding process 
has been completed for the coded arbitrary shape sig- 
nal Ep corresponding to one object to generate the 
decoded arbitrary shape signal Dp corresponding to the 
object, image display of the object is possible. 

10 Although no description is given of processing of an 
arbitrary shape image signal including a multivalued 
transparency signal (arbitrary shape image signal with 
transparency information) with respect to figures 24(a)- 
24(c), the multivalued transparency signal is processed 

is in the same manner as the texture signal (pixel value 
signal) if the arbitrary shape image signal includes a 
multivalued transparency signal. 

As described above, in the conventional method of 
coding an image signal, the arithmetic coding method 

so employed in JBIG or the like is used for coding of a 
binary image signal, i.e., a binary shape signal, while 
the information source coding method employed in 
MPEG2 or the like is used for coding of a digital image 
signal for color display of an image, i.e., a texture signal. 

25 Further, the coding method of MPEG4, i.e., combination 
of arithmetic coding and information source coding is 
used for coding of an image signal including a binary 
shape signal and a texture signal. 

It is a matter of course that, when decoding coded 

30 signals obtained by different coding methods, different 
decoding methods corresponding to the respective cod- 
ing methods should be employed. In other words, since 
different coding melhods are used for coding different 
kinds of image signals in the conventional coding proc- 

35 ess, different data analysis methods should be 
employed in the decoding process of the coded signals. 

Although an image decoding apparatus based on 
MPEG4 is able to decode any of coded signals of a 
binary image signal (JBIG), a digital image signal 

40 (MPEG2), and an arbitrary shape image signal 
(MPEG4), this apparatus has the following drawbacks. 

In a coded arbitrary shape signal obtained by block- 
by-block coding of an arbitrary shape image signal, 
coded shape bit streams corresponding to the respec- 
ts five blocks and coded texture bit streams corresponding 
to the respective blocks are alternatingly arranged. On 
the other hand, in a coded binary signal obtained by 
block-by-block coding of a binary image signal, coded 
shape bit streams corresponding to the respective 

so blocks are successively arranged. Therefore, in the data 
analysis method (data analyzer) according to MPEG4, 
the above-described control signal SWb cannot be gen- 
erated by analysis of the coded binary image signal 
containing no coded texture bit stream, as such the 

55 coded shape bit streams of the respective blocks in the 
coded binary signal cannot be successively output 
toward the arithmetic decoder 170a. To be specific, in 
the image decoding apparatus 200b, when processing 
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of a coded shape bit stream corresponding to some 
block has ended, the end timing signal Te is output from 
the arithmetic decoder 170a toward the switch 101b, 
whereby the switch 101b supplies the bit stream from 
the data analyzer 1 60 to the information source decoder . 5 
180a. However, since no coded texture bit stream is 
included in the coded binary signal, the data analyzer 
160 cannot generate the control signal SWb tor control- 
ling the switch 101b so that the bit stream is input to the 
arithmetic decoder 1 70a, and a coded shape bit stream 10 
corresponding to the next block is input to the informa- 
tion source decoder 1 80a. 

Hence, to support coding of the binary image sig- 
nal, a dummy texture bit stream corresponding to the 
block is added using the conventional method after the is 
coded shape bit stream of each block, thereby making 
the data structure of the coded binary signal apparently 
identical to the data structure of the coded arbitrary 
shape signal. In this case, the coded binary signal can 
be analyzed by the data analysis method based on 20 
MPEG4 and decoded by the image decoding process 
based on MPEG4. 

However, since the coded dummy texture bit stream 
is added to the coded shape bit stream when the coded 
binary image signal is output, the bit number is wasted 25 
in the coding process, resulting in a reduction in the 
coding efficiency. 

In the decoding process based on MPEG4, a coded 
image signal (coded pixel value signal), which is 
obtained by coding a digital image signal (rectangle 30 
image signal) corresponding to MPEG2 and comprising 
only a texture signal, is decodable as well as a coded 
arbitrary shape signal obtained by coding an arbitrary 
shape image signal. The reason is as follows. In a 
coded image signal including coded texture bit streams, 35 
since starting point and end point of the coded texture 
bit stream corresponding to each block is detectable, 
the switch 101b can be controlled by the control signal 
SWb so that the coded texture bit stream is always input 
to the information source decoder 1 80b. to 

Further, in the decoding process corresponding to 
the coding process based on MPEG4, if the processing 
is overloaded, when decoding a coded arbitrary shape 
signal obtained by coding an arbitrary shape image sig- 
nal, both of the coded shape bit stream and the coded 45 
texture bit stream corresponding to each block cannot 
be decoded within a display time that is set in advance, 
as such the motion of image on the display lacks of 
smoothness or stops. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a 
data structure for image transmission, an image decod- 
ing method, and an image decoding apparatus, which 55 
enable decoding of coded image signals having differ- 
ent data structures obtained by coding digital image sig- 
nals corresponding to different schemes or the like, by 



decoding processes corresponding to a single coding 
method, for example, those capable of decoding both of 
a binary image signal and an arbitrary shape image sig- 
nal, without increasing the bit number during the coding 
process. 

Another object of the present invention is to provide 
a data storage medium containing a program for imple- 
menting the decoding processes according to the 
above-described image decoding method, and a data 
storage medium containing a coded image signal hav- 
ing the above-mentioned data structure for image trans- 
mission. 

Still another object of the present invention is to 
provide an image coding method and an image coding 
apparatus, which can create a coded image signal hav- 
ing a data structure for image transmission, which data 
structure enables decoding of coded image signals hav- 
ing different data structures obtained by coding digital 
image signals corresponding to different schemes or 
the like by decoding processes corresponding to a sin- 
gle coding method, for example, decoding of a binary 
image signal and an arbitrary shape image signal with- 
out increasing the bit number during the coding proc- 
ess. 

Yet another object of the present invention is to pro- 
vide an image decoding apparatus which can reproduce 
a coded image signal by decoding while maintaining a 
smooth motion of image on a display screen even when 
the load of an arithmetic processor performing the 
decoding is large. 

Other objects and advantages of the invention will 
become apparent from the detailed description that fol- 
lows. The detailed description and specific embodi- 
ments described are provided only for illustration since 
various additions and modifications within the scope of 
the invention will be apparent to those of skill in the art 
from the detailed description. 

According to a first aspect of the present invention, 
there is provided a data structure for transmitting a 
coded image signal obtained by coding a digital image 
signal, including: 

at least coded shape bit streams between the fol- 
lowing two kinds of bit streams: coded shape bit 
streams obtained by coding a shape signal which 
represents a display image of binary information or 
the shape in binary format of each object as one of 
the components of a display image; and coded pixel 
value bit streams obtained by coding a pixel value 
signal representing the gradation of the object; and 
an image identifier for deciding whether the coded 
image signal includes, as the coded bit streams, 
both of the coded shape bit streams and the coded 
pixel value bit streams, or only the coded shape bit 
streams; 

wherein the image identifier and the coded bit 
streams are arranged so that the image identifier is 
followed by the coded bit streams. 
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Therefore, by referring to the image identifier, a 
coded signal including no coded pixel value bit streams 
can be analyzed using a data analysis method for a 
coded signal including coded pixel value bit streams. 

According to a second aspect of the present inven- 
tion, in the above-described data structure for image 
transmission, the image identifier comprises a 2-bit 
code. Therefore, it is possible to realize decoding proc- 
esses for four kinds of coded image signals including a 
coded binary signal and a coded arbitrary shape signal, 
corresponding to a single coding method. 

According to a third aspect of the present invention, 
there is provided an image coding method receiving a 
digital image signal, and subjecting the digital image 
signal to a coding process according to the data struc- 
ture of the digital image signal, wherein: 

it is decided whether the digital image signal is a 
binary image signal including, as data for display, 
only a shape signal representing a display image of 
binary information, or an arbitrary shape image sig- 
nal including, as data for display, both of a shape 
signal representing the shape of each object as one 
of the components of a display image and a pixel 
value signal representing the gradation of the 
object; 

for the binary image signal, the shape signal is sub- 
jected to a first coding process and, in the coding 
process, an image identifier having a first value is 
generated, thereby creating a coded binary signal 
including the image identifier; 
for the arbitrary shape image signal, the shape sig- 
nal is subjected to the first coding process while the 
pixel value signal is subjected to a second coding 
process employing a coding method different from 
that of the first coding process and, in the coding 
process, an image identifier having a second value 
is generated, thereby creating a coded arbitrary 
shape signal including the image identifier; and 
in response to the input digital image signal, one of 
the coded binary signal and the coded arbitrary 
shape signal is output. 

Therefore, even though image signals having differ- 
ent data structures are coded by different coding meth- 
ods, these coded image signals can be decoded in 
decoding processes corresponding to a single coding 
method. In addition, the bit number hardly increases 
during the coding process. 

According to a fourth aspect of the present inven- 
tion, in the above-described image coding method, the 
image identifier comprises a 2-bit code. Therefore, cod- 
ing processes for four kinds of image signals including a 
binary image signal and an arbitrary shape image signal 
are carried out so that coded signals corresponding to 
these image signals can be identified at the decoding 
end. 

According to a fifth aspect of the present invention, 



there is provided an image coding apparatus receiving a 
digital image signal, and subjecting the digital image 
signal to a coding process according to the data struc- 
ture of the digital image signal, comprising: 

5 

signal identification means for receiving the digital 
image signal, and deciding whether the digital 
image signal is a binary image signal including, as 
data for display, only a shape signal showing a dis- 

w play image of binary information, or an arbitrary 
shape image signal including, as data for display, 
both of a shape signal showing the shape of each 
object as one of the components of a display image 
and a pixel value signal representing the gradation 

is of the object, and outputting an identifier signal 
according the result of the decision; 
signal extraction means for extracting the shape 
signal from the binary image signal, and the shape 
signal and the pixel value signal from the arbitrary 

20 shape image signal; 

first coding means for coding the shape signals by 
a first coding process to generate coded shape bit 
streams; 

second coding means for coding the pixel value sig- 
25 nal by a second coding process employing a coding 
method different from that of the first coding proc- 
ess, thereby generating coded pixel value bit 
streams; 

signal supply means for selecting, according to the 

30 identifier signal , one of first and second operations, 
where the first operation is to supply the shape sig- 
nal of the binary image signal to the first coding 
means and the second operation is to supply the 
shape signal of the arbitrary shape image signal to 

35 the first coding means while supplying the pixel 
value signal of the arbitrary shape image signal to 
the second coding means; and 
multiplexing means for multiplexing the identifier 
signal from the signal identification means, the 

<to coded shape bit streams output from the first cod- 
ing means, and the coded pixel value bit streams 
output from the second coding means; 
wherein a coded binary signal including the identi- 
fier signal and the coded shape bit streams is out- 

45 put when the binary image signal is input as the 
digital image signal, and a coded arbitrary shape 
signal including the identifier signal, the coded 
shape bit streams, and the coded pixel value bit 
streams is output when the arbitrary shape image 

so signal is input as the digital image signal. 

Therefore, even though image signals having differ- 
ent data structures are coded by different coding meth- 
ods, these coded signals can be decoded in decoding 
55 processes corresponding to a single coding method. In 
addition, the bit number hardly increases during the 
coding process. 

According to a sixth aspect of the present invention, 
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there is provided an image decoding method receiving, 
as a coded signal obtained by coding a digital image 
signal, a coded image signal having an image identifier 
according to the data structure of the digital image sig- 
nal, and subjecting the coded image signal to a decod- 5 
ing process according to the data structure, wherein: 

the coded image signal is analyzed with reference 
to the image identifier to decide whether the coded 
image signal is a coded arbitrary shape signal to 
including, as data for display, both of coded shape 
bit streams obtained by coding a shape signal 
which represents the shape of each object as one 
of the components of a display image and coded 
pixel value bit streams obtained by coding of a pixel >5 
value signal representing the gradation of the 
object, or a coded binary signal including, as data 
for display, only coded shape bit streams obtained 
by coding a shape signal representing a display 
image of binary information; 20 
when the input coded image signal is the coded 
arbitrary shape signal, decoding of the coded 
shape bit streams thereof by a first decoding 
method, and decoding of the coded pixel value bit 
streams thereof by a second decoding method, are 25 
carried out according to the arrangement of these 
bit streams; and 

when the input coded image signal is the coded 
binary signal, only decoding of the coded shape bit 
streams thereof by the first coding method is car- 30 
ried out. 

Therefore, even though image signals having differ- 
ent data structures are coded by different coding meth- 
ods, these coded image signals can be decoded in 35 
decoding processes corresponding to a single coding 
method. In addition, the bit number hardly increases 
during the coding process. 

According to a seventh aspect of the present inven- 
tion, in the above-described image decoding method. 10 
the image identifier comprises a 2-bit code. Therefore, it 
is possible to decode four kinds of coded image signals 
including a coded binary signal and a coded arbitrary 
shape signal, in decoding processes corresponding to a 
single coding method. 45 

According to an eighth aspect of the present inven- 
tion, there is provided an image decoding apparatus 
receiving, as a coded signal obtained by coding a digital 
image signal, a coded image signal having an image 
identifier according to the data structure of the digital so 
image signal, and subjecting the coded image signal to 
a decoding process according to the data structure, 
comprising: 

data analysis means for deciding whether the 55 
coded image signal is a coded arbitrary shape sig- 
nal including, as data for display, both of coded 
shape bit streams obtained by coding a shape sig- 
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nal which represents the shape of each object as 
one of the components of a display image and 
coded pixel value bit streams obtained by coding a 
pixel value signal representing the gradation of the 
object, or a coded binary signal including, as data 
for display, only coded shape bit streams obtained 
by coding a shape signal representing a display 
image of binary information; 
first decoding means for decoding the coded shape 
bit streams by a first decoding process to generate 
a decoded shape signal; 

second decoding means for decoding the coded 
pixel value bit streams by a second decoding proc- 
ess employing a decoding method different from 
that of the first decoding process, thereby generat- 
ing a decoded pixel value signal; 
signal switch means for supplying the coded image 
signal to one of the first and second decoding 
means, according to a switch control signal; 
the data analysis means supplying a first switch 
control signal to the signal switch means when the 
coded image signal is the coded binary signal, and 
supplying a second switch control signal to the sig- 
nal switch means when the coded image signal is 
the coded arbitrary shape signal; and 
the signal switch means being in the fixed switching 
state where the coded shape bit streams of the 
coded binary signal are input to the first decoding 
means alone when it receives the first switch con- 
trol signal, and being in the alternate switching state 
where, according to the arrangement of the coded 
shape bit streams and the coded pixel value bit 
streams in the coded arbitrary shape signal, the 
coded shape bit streams are input to the first 
decoding means while the coded pixel bit streams 
are input to the second deciding means, when it 
receives the second switch control signal. 

Therefore, even though image signals having differ- 
ent data structures are coded by different coding meth- 
ods, these coded image signals can be decoded in 
decoding processes corresponding to a single coding 
method. In addition, the bit number hardly increases 
during the coding process. 

According to a ninth aspect of the present inven- 
tion, the above-described image decoding apparatus 
further comprises over-load detection means for detect- 
ing that the load on the decoding process by one of the 
first and second decoding means exceeds a threshold 
value which is set in advance, and outputting an over- 
load detection signal toward the data analysis means: 

wherein, when the over-load detection signal is 
input to the data analysis means, the data analysis 
means outputs a third switch control signal toward 
the signal switch means; and 
in response to the third switch control signal, the 
signal switch means supplies either of the coded 
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shape bit streams and the coded pixel value bit 
streams in the coded arbitrary shape signal to one 
of the first and second decoding means. 

Therefore, coded image signals having different 
data structures can be decoded in decoding processes 
corresponding to a single coding method, and the bit 
number hardly increases during the coding process. 
Further, even when a processor for decoding is over- 
loaded, the coded image signals are reproduced by the 
decoding processes while maintaining smooth motion 
of image on the display screen. 

According to a tenth aspect of the present inven- 
tion, the above-described image decoding apparatus 
further comprises control signal input means for input- 
ting a manual control signal from the outside to the data 
analysis means: 



Therefore, by referring to the image identifier, at 
least a coded image signal including coded shape bit 
streams and a coded image signal including no coded 
shape bit streams can be identified among various 

5 kinds of coded image signals. 

According to a twelfth aspect of the present inven- 
tion, there is provided an image decoding method 
receiving, as a coded signal obtained by coding a digital 
image signal, a coded image signal having an image 

w identifier according to the data structure of the digital 
image signal, and subjecting the coded image signal to 
a decoding process according to the data structure, 
wherein: 

15 the coded image signal is analyzed with reference 
to the image identifier to decide which of the follow- 
ing three coded signals is the coded image signal; 



wherein, when the data analysis means receives 
the manual control means, it outputs a third switch 20 
control signal toward the signal switch means; and 
in response to the third switch control signal, the 
signal switch means supplies the coded shape bit 
streams of the coded arbitrary shape signal to the 
first decoding means, and does not supply the 25 
coded pixel value bit streams to the second decod- 
ing means. 

Therefore, coded image signals having different 
data structures can be decoded in decoding processes 30 
corresponding to a single coding method, and the bit 
number hardly increases during the coding process. 
Further, when image data stored in a recording medium 
are retrieved, only decoding of coded shape bit streams 
is continued until an object image is found, whereby the 35 
data retrieval is carried out quickly. 

According to an eleventh aspect of the present 
invention, there is provided a data structure for transmit- 
ting a coded image signal obtained by coding a digital 
image signal, including: 40 

at least one of the following two kinds of coded bit 
streams: coded shape bit streams obtained by cod- 
ing a shape signal which represents a display 
image of binary information or the shape in binary 45 
format of each object as one of the components of 
a display image, and coded pixel value bit streams 
obtained by coding a pixel value signal representing 
the gradation of the display image or the object; and 
an image identifier comprising a multiple-bit code, so 
for deciding whether the coded image signal 
includes, as the coded bit streams, at least the 
coded shape bit streams, or only the coded pixel 
value bit streams: 

wherein the image identifier and the coded bit 55 
streams are arranged so that the image identifier is 
followed by the coded bit streams. 



a coded arbitrary shape signal including, as 
data for display, both of coded shape bit 
streams obtained by coding a shape signal 
which represents the shape of each object as 
one of the components of a display image, and 
coded pixel value bit streams obtained by cod- 
ing a pixel value signal representing the grada- 
tion of the object, 

a coded pixel value signal including, as data for 
display, only coded pixel value bit streams 
obtained by coding a pixel value signal repre- 
senting the gradation of a display image, and 
a coded binary signal including, as data for dis- 
play, only coded shape bit streams obtained by 
coding a shape signal representing a display 
image of binary information; 

when the input coded image signal is the coded 
pixel value signal, decoding of the coded pixel value 
bit streams thereof is carried out; and 
when the input coded image signal is the coded 
binary signal or the coded arbitrary shape signal, 
decoding of the coded bit streams included in these 
signals is stopped. 

Therefore, among a coded binary signal, a coded 
arbitrary shape signal, and a coded rectangle signal 
(coded pixel value signal), only the coded rectangle sig- 
nal can be selected and decoded by a decoding appa- 
ratus for decoding only the coded rectangle signal 
which is obtained by coding a pixel value signal for color 
display of image. 

According to a thirteenth aspect of the present 
invention, there is provided an image decoding appara- 
tus receiving, as a coded signal obtained by coding a 
digital image signal, a coded image signal having an 
image identifier according to the data structure of the 
digital image signal, and subjecting the coded image 
signal to a decoding process according to the data 
structure, comprising: 
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data analysis means for analyzing the coded image 
signal according to the image identifier to decide 
which of the following three coded signals is the 
coded image signal; 

a coded arbitrary shape signal including, as 
data for display, both of coded shape bit 
streams obtained by coding a shape signal 
which represents the shape of each object as 
one of the components of a display image, and 
coded pixel value bit streams obtained by cod- 
ing a pixel value signal representing the grada- 
tion of the object, 

a coded pixel value signal including, as data for 
display, only coded pixel value bit streams 
obtained by coding a pixel value signal repre- 
senting the gradation of a display image, and 
a coded binary signal including, as data for dis- 
play, only coded shape bit streams obtained by 
coding a shape signal representing a display 
image of binary information; 

decoding means for decoding the coded pixel value 
bit streams to generate a decoded pixel value sig- 
nal; 

signal discarding means for discarding the coded 
image signal; 

signal switch means for supplying the coded image 
signal to one of the decoding means and the signal 
discarding means according to a switch control sig- 
nal; 

the data analysis means supplying a first switch 
control signal to the signal switch means when the 
coded image signal is the coded pixel value signal, 
and supplying a second switch control signal to the 
signal switch means when the coded image signal 
is the coded arbitrary shape signal or the coded 
binary signal; and 

the signal switch means supplying the coded pixel 
value bit streams of the coded pixel value signal to 
the decoding means when it receives the first 
switch control signal, and supplying the coded bit 
streams included in the coded arbitrary shape sig- 
nal and the coded binary signal to the signal dis- 
carding means when it receives the second switch 
control signal. 

Therefore, among a coded binary signal, a coded 
arbitrary shape signal, and a coded rectangle signal 
(coded pixel value signal), only the coded rectangle sig- 
nal can be selected and decoded by a decoding appa- 
ratus for decoding only the coded rectangle signal 
which is obtained by coding a pixel value signal for color 
display of image. 

According to a fourteenth aspect of the present 
invention, there is provided a data structure for transmit- 
ting a coded image signal obtained by coding a digital 
image signal, including: 



at least one of the following three kinds of coded bit 
streams: coded shape bit streams obtained by cod- 
ing a shape signal which represents a display 
image of binary information or the shape in binary 

5 format of each object as one of the components of 
a display image; coded pixel value bit streams 
obtained by coding a pixel value signal representing 
the gradation of the display image or the object; and 
coded transparency bit streams obtained by coding 

jo a multivalued transparency signal representing the 
transparency of the object; and 
an image identifier comprising a multiple-bit code, 
for deciding which of the following four coded image 
signals is the coded image signal; 

15 

a coded pixel value signal including, as the 
coded bit streams, only the coded pixel value 
bit streams, 

a first coded arbitrary shape signal including, 
20 as the coded bit streams, the coded shape bit 

streams and the coded pixel value bit streams, 
a coded binary signal including, as the coded 
bit streams, only the coded shape bit streams, 
and 

25 a second coded arbitrary shape signal includ- 

ing, as the coded bit streams, the coded shape 
bit streams, the coded pixel value bit streams, 
and the coded transparency bit streams; 

30 wherein the image identifier and the coded bit 
streams are arranged so that the image identifier is 
followed by the coded bit streams. 

Therefore, by referring to the image identifier, 
35 among various kinds of coded image signals, a coded 
image signal including at least one of coded shape bit 
streams, coded pixel value bit streams, and coded 
transparency bit streams can be identified. 

According to a fifteenth aspect of the present inven- 
40 tion. there is provided an image decoding method 
receiving, as a coded signal obtained by coding a digital 
image signal, a coded image signal having an image 
identifier according to the data structure of the digital 
image signal, and subjecting the coded image signal to 
45 a decoding process according to the data structure, 
wherein: 

the coded image signal is analyzed with reference 
to the image identifier to decide which of the follow- 
so ing four coded signals is the coded image signal; 

a first coded arbitrary shape signal including, 
as data for display, coded shape bit streams 
obtained by coding a shape signal representing 
55 the shape of each object as one of the compo- 

nents of a display image, and coded pixel value 
bit streams obtained by coding a pixel value 
signal representing the gradation of the object, 
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a second coded arbitrary shape signal includ- 
ing, as data for display, in addition to coded 
shape bit streams and coded pixel value bit 
streams, coded transparency bit streams 
obtained by coding a multivalued transparency 
signal representing the transparency of the 
object, 

a coded pixel value signal including, as data for 
display, only coded pixel value bit streams 
obtained by coding a pixel value signal repre- 
senting the gradation of a display image, and 
a coded binary signal including, as data for dis- 
play, only coded shape bit streams obtained by 
coding a shape signal representing a display 
image of binary information; 

when the input coded image signal is the first coded 
arbitrary shape signal, decoding of the coded 
shape bit streams thereof by a first decoding 
method, and decoding of the coded pixel value bit 
streams thereof by a second decoding method, are 
carried out according to the arrangement of these 
bit streams; 

when the input coded image signal is the second 
coded arbitrary shape signal, decoding of the 
coded shape bit streams thereof by the first decod- 
ing method, decoding of the coded pixel value bit 
streams thereof by the second decoding method, 
and decoding of the coded transparency bit 
streams thereof by a third decoding method, are 
carried out according to the arrangement of these 
bit streams; 

when the input coded image signal is the coded 
pixel value signal, only decoding of the coded pixel 
value bit streams thereof by the second decoding 
method is carried out; and 
when the input coded image signal is the coded 
binary signal, only decoding of the coded shape bit 
streams thereof by the first decoding method is car- 
ried out. 

Therefore, for example, the first and second coded 
arbitrary shape signal, the coded rectangle signal 
(coded pixel value signal), and the coded binary signal 
can be decoded by the first to third decoding methods 
corresponding to three coded bit streams included in 
the second coded arbitrary shape signal. 

According to a sixteenth aspect of the present 
invention, there is provided a data structure for transmit- 
ting a coded image signal obtained by coding a digital 
image signal, including: 

at least one of the following three kinds of coded bit 
streams: coded shape bit streams obtained by cod- 
ing a shape signal which represents a display 
image of binary information or the shape in binary 
format of each object as one of the components of 
a display image, coded pixel value bit streams 
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obtained by coding a pixel value signal representing 
the gradation of the display image or the object, and 
coded transparency bit streams obtained by coding 
a multivalued transparency signal representing the 
5 transparency of the object; and 

an image identifier comprising a multiple-bit code, 
for deciding which of the following live coded image 
signals is the coded image signal; 

to a coded pixel value signal including, as the 

coded bit streams, only the coded pixel value 
bit streams, 

a first coded arbitrary shape signal including, 
as the coded bit streams, the coded shape bit 

15 streams and the coded pixel value bit streams, 

a coded binary signal including, as the coded 
bit streams, only the coded shape bit streams, 
a second coded arbitrary shape signal includ- 
ing, as the coded bit streams, the coded shape 

so bit streams, the coded pixel value bit streams, 

and the coded transparency bit streams, and 
a coded transparency signal including, as the 
coded bit streams, the coded shape bit streams 
and the coded transparency bit streams; 

25 

wherein the image identifier and the coded bit 
streams are arranged so that the image identifier is 
followed by the coded bit streams. 

30 Therefore, by referring to the image identifier, 
among various kinds of coded image signals, a coded 
image signal including at least one of coded shape bit 
streams, coded pixel value bit streams, and coded 
transparency bit streams can be identified. 

35 According to a seventeenth aspect of the present 
invention, there is provided an image decoding method 
receiving, as a coded signal obtained by coding a digital 
image signal, a coded image signal having an image 
identifier according to the data structure of the digital 

40 image signal, and subjecting the coded image signal to 
a decoding process according to the data structure, 
wherein: 

the coded image signal is analyzed with reference 
45 to the image identifier to decide which of the follow- 
ing five coded signals is the coded image signal; 

a first coded arbitrary shape signal including, 
as data for display, coded shape bit streams 

so obtained by coding a shape signal representing 

the shape of each object as one of the compo- 
nents of a display image, and coded pixel value 
bit streams obtained by coding a pixel value 
signal representing the gradation of the object, 

55 a second coded arbitrary shape signal includ- 

ing, as data for display, in addition to coded 
shape bit streams and coded pixel value bit 
streams, coded transparency bit streams 
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obtained by coding a multivalued transparency 
signal representing the transparency of the 
object, 

a coded transparency signal including, as data 
for display, coded shape bit streams obtained 5 
by coding a shape signal representing the 
shape of the object, and coded transparency 
bit stream obtained by coding a multivalued 
signal representing the transparency of the 
object, 10 
a coded binary signal including, as data for dis- 
play, only coded shape bit streams obtained by 
coding a shape signal representing a display 
image of binary information, and 
a coded pixel value signal including, as data for ;s 
display, only coded pixel value bit streams 
obtained by coding a pixel value signal repre- 
senting the gradation of a display image; and 

when the input coded image signal is the first coded 20 
arbitrary shape signal, decoding of the coded 
shape bit streams thereof by a first decoding 
method, and decoding of the coded pixel value bit 
streams thereof by a second decoding method, are 
carried out according to the arrangement of these 25 
bit streams; 

when the input coded image signal is the second 
coded arbitrary shape signal, decoding of the 
coded shape bit streams thereof by the first decod- 
ing method, decoding of the coded pixel value bit 30 
streams thereof by the second decoding method, 
and decoding of the coded transparency bit 
streams thereof by a third decoding method, are 
carried out according to the arrangement of these 
bit streams; 35 
when the input coded image signal is the coded 
transparency signal, decoding of the coded shape 
bit streams thereof by the first decoding method, 
and decoding of the coded transparency bit 
streams by the third decoding method, are carried 40 
out according to the arrangement of these bit 
streams; 

when the input coded image signal is the coded 
binary signal, only decoding of the coded shape bit 
streams thereof by the first decoding method is car- 45 
ried out; and 

when the input coded image signal is the coded 
pixel value signal, only decoding of the coded pixel 
value bit streams thereof by the second decoding 
method is carried out. so 

Therefore, for example, the coded arbitrary shape 
signal, the coded rectangle signal (coded pixel value 
signal), the coded binary signal, and the coded trans- 
parency signal can be decoded by the first to third 55 
decoding methods corresponding to three coded bit 
streams included in the coded arbitrary shape signal 
with transparency information. 



According to an eighteenth aspect of the present 
invention, there is provided a data storage medium con- 
taining a program for implementing a decoding process 
of a coded image signal by a computer, the program 
being constructed so that a decoding process according 
to any of the above-described image decoding methods 
is executed by the computer. Therefore, when the pro- 
gram is loaded into the computer, the decoding process 
according to the above-described decoding method can 
be implemented by software. 

According to a nineteenth aspect of the present 
invention, there is provided a data storage medium con- 
taining a coded image signal obtained by coding a dig- 
ital image signal, the coded image signal having any of 
the above-described data structures for image transmis- 
sion. Therefore, when the stored coded image signals 
are read from the medium and decoded, these coded 
image signals, having different data structures, can be 
identified. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1(a) and 1(b) are diagrams showing a data 
structure of a coded arbitrary shape signal and a 
data structure of a coded binary signal, respec- 
tively, as data structures for image transmission 
according to a first embodiment of the invention. 
Figures 2(a) and 2(b) are diagrams showing a data 
structure of a coded arbitrary shape signal and a 
data structure of a coded binary signal, respec- 
tively, as data structures for image transmission 
according to a modification of the first embodiment. 
Figure 3 is a block diagram illustrating an image 
coding apparatus according to a second embodi- 
ment of the invention. 

Figure 4 is a flowchart for explaining a coding proc- 
ess by the image coding apparatus according to the 
second embodiment. 

Figure 5 is a block diagram illustrating an image 
decoding apparatus according to a third embodi- 
ment of the invention. 

Figure 6 is a flowchart for explaining a decoding 
process by the image decoding apparatus accord- 
ing to the third embodiment. 
Figure 7(a) is a block diagram showing an image 
input unil included in the image coding apparatus 
according to the second embodiment, and figure 
7(b) is a block diagram showing a data analyzer 
included in the image decoding apparatus accord- 
ing to the third embodiment. 
Figure 8 is a block diagram illustrating an image 
decoding apparatus according to a fourth embodi- 
ment of the invention. 

Figure 9 is a block diagram illustrating an image 
decoding apparatus according to the fifth embodi- 
ment of the invention. 

Figures 10(a), 10(b), and 10(c) are diagrams show- 
ing data structures of a coded arbitrary shape sig- 
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nal, a coded binary signal, and a coded rectangle 
signal, respectively, as data structures for image 
transmission according to a sixth embodiment of 
the invention. 

Figure 1 1 is a block diagram illustrating an image 
decoding apparatus according to a seventh embod- 
iment of the invention. 

Figure 1 2 is a flowchart for explaining a decoding 
process by the image decoding apparatus accord- 
ing to the seventh embodiment. 
Figures 13(a) and 13(b) are diagrams illustrating a 
data structure of a coded arbitrary shape signal and 
a data structure of a coded binary signal, respec- 
tively, as data structures for image transmission 
according to an eighth embodiment of the inven- 
tion. 

Figures 14(a) and 14(b) are diagrams illustrating a 
data structure of a coded rectangle signal and a 
data structure of a coded arbitrary shape signal 
with transparency information, respectively, as data 
structures for image transmission according to the 
eighth embodiment. 

Figure 1 5 is a block diagram illustrating an image 
decoding apparatus according to a ninth embodi- 
ment of the invention. 

Figure 16 is a flowchart for explaining a decoding 
process by the image decoding apparatus accord- 
ing to the ninth embodiment. 
Figures 17(a), 17(b), and 17(c) are data structures 
of a coded arbitrary shape signal, a coded binary 
signal, and a coded rectangle signal, respectively, 
as data structures for image transmission accord- 
ing to a tenth embodiment of the invention. 
Figures 18(a) and 18(b) are diagrams showing a 
data structure of a coded arbitrary shape signal 
with transparency information and a data structure 
of a coded transparency signal, respectively, as 
data structures for image transmission according to 
the tenth embodiment. 

Figure 1 9 is a block diagram illustrating an image 
decoding apparatus according to an eleventh 
embodiment of the invention. Figure 20 is a flow- 
chart for explaining a decoding process 
by the image decoding apparatus according to the 
eleventh embodiment. 

Figures 21(a)-21(c) are diagrams for explaining a 
data storage medium (figures 21(a),21(b)) which 
contains a program for implementing an image cod- 
ing apparatus or an image decoding apparatus 
according to any of the aforementioned embodi- 
ments, using a computer system (figure 21 (c)). 
Figures 22(a)-22(c) are diagrams for explaining a 
communication system using the JBIG type data 
compressive coding process and, more specifically, 
figures 22(a) and 22(b) show an image coding 
apparatus and an image decoding apparatus 
included in the system, respectively, and figure 
22(c) shows a data structure of a coded image sig- 
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nal adopted in the system. 
Figures 22(a)-22(c) are diagrams for explaining a 
communication system using a data compressive 
coding process based on JBIG, more specifically, 

5 figures 22(a) and 22(b) show an image coding 
apparatus and an image decoding apparatus 
included in the system, respectively, and figure 
22(c) shows a data structure of a coded image sig- 
nal adopted in the system. 

10 Figures 23(a)-23 (c) are diagrams for explaining a 
communication system using a data compressive 
coding process based on MPEG2, more specifi- 
cally, figures 23(a) and 23(b) show an image coding 
apparatus and an image decoding apparatus 

is included in the system, respectively, and figure 
23(c) shows a data structure of a coded image sig- 
nal adopted in the system. 
Figures 24(a)-24(c) are diagrams for explaining a 
communication system using a data compressive 

so coding process based on MPEG4, more specifi- 
cally, figures 24(a) and 24(b) show an image coding 
apparatus and an image decoding apparatus 
included in the system, respectively, and figure 
24(c) shows a data structure of a coded image sig- 

25 nal adopted in the system. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

30 [Embodiment 1] 

Figures 1(a) and 1(b) are diagrams illustrating data 
structures for image transmission according to a first 
embodiment of the present invention. Figure 1(a) shows 

35 a data structure of a coded arbitrary shape signal 
obtained by coding an arbitrary shape image signal cor- 
responding to each of plural objects constituting a dis- 
play image. The data structure for image transmission 
according to the first embodiment is applied to a coding 

40 and decoding system in which an arbitrary shape image 
signal is compressively coded and transmitted and, fur- 
ther, the coded signal is decoded and displayed. That is, 
in this system, the image coding method and apparatus 
are basically constructed to perform a coding process 

45 for the arbitrary shape image signal while the image 
decoding method and apparatus are basically con- 
structed to perform a decoding process for the coded 
arbitrary shape signal. 

In figure 1(a), reference numeral 500 designates a 

so coded arbitrary shape signal obtained by coding an 
arbitrary shape image signal corresponding to each of 
plural objects constituting an image of one display 
image. The coded arbitrary shape signal 500 includes a 
32-bit synchronous signal 501 at the beginning, a 1-bit 

55 shape identifier (image identifier) 502 that follows the 
signal 501 , and an additional header 503 that follows 
the identifier 502. In the figure, the shape identifier 502 
is represented as "SID". 



12 



23 EP 0 891 093 A2 24 



Further, the coded arbitrary shape signal 500 
includes coded shape bit streams 51 A which are 
obtained by coding a shape signal that represents the 
shape of each object as a constituent of the arbitrary 
shape image signal, and coded texture bit streams 5 
(coded pixel value bit streams) 52A which are obtained 
by coding a texture signal (pixel value signal) that com- 
prises a luminance signal and a color difference signal 
for color display of each object as a constituent of the 
arbitrary shape image signal. To be specific, in the 70 
coded arbitrary shape signal 500, for each of blocks into 
which an object region including an object on a display 
image is divided, a coded shape bit stream 51a and a 
coded texture bit stream 52A are arranged in this order. 

More specifically, following the header 503, 15 
arranged are a coded shape bit stream 51A1 and a 
coded texture bit stream 52A1 corresponding to a block 
A1 , a coded shape bit stream 51 A2 and a coded texture 
bit stream 52A2 corresponding to a block A2, and a 
coded shape bit stream 51 A3 and a coded texture bit 20 
stream 52A3 corresponding to a block A3. 

Further, the coded shape bit streams 51 A1, 51 A2, 
and 51 A3 comprise variable-length coded data 504, 
509, and 51 4 corresponding to shape motion vectors (in 
figure 1(a), "shape MV");.and variable-length coded zs 
data 505. 51 0, and 51 5 corresponding to binary shape 
signals, each representing whether pixels within the 
block are located inside the object or outside the object 
(in the figure, "shape data"), respectively. 

The coded texture bit streams 52A1, 52A2, and so 
52A3 comprise 5-bit coded data 506, 51 1 , and 516 cor- 
responding the quantization scales (in figure 1(a), 
"quantization scale"), variable-length coded data 507. 
512, and 517 corresponding to texture motion vectors 
(in the figure, "texture MV"); and variable-length coded 35 
data 508, 513, and 518 corresponding to quantized sig- 
nals obtained by subjecting the texture signal to DCT 
and quantization (in the figure, "texture DCT coeffi- 
cients"), respectively. 

The synchronous signal 501 is a signal indicating 40 
the head of a coded arbitrary shape signal correspond- 
ing to one object, and this is a unique coded signal. The 
shape identifier (SID) 502 is a signal indicating whether 
or not both of coded shape bit streams and coded tex- 
ture bit streams are included in the coded image signal. 45 
When the value of the shape identifier (SID) is 0, it indi- 
cates that both of the coded shape bit streams and the 
coded texture bit streams exist in the coded image sig- 
nal. When SID = 1, the shape identifier indicates that 
only the coded shape bit streams exist in the coded sig- so 
nal. Therefore, in the data structure of figure 1 (a), the 
shape identifier (SID) 502 is 0. The header 503 includes 
information relating to, for example, the time to display 
the image of the corresponding object, the attribute of 
the image, and the prediction mode tor coding. How- 55 
ever, these data have no relationship with the present 
invention and, therefore, do not require detailed descrip- 
tion. 



The shape MV is the coded data of a motion vector 
which represents a motion of an image within a block 
between the present frame and the previous frame, and 
this is used for prediction of a shape signal of a specific 
block in the present frame from a shape signal of the 
corresponding block in the previous frame. Further, the 
shape data is obtained by subjecting a shape signal to 
arithmetic coding, and the quantization scale is a 
parameter for inversely quantizing DCT coefficients 
which are obtained by subjecting a texture signal to 
DCT and quantization. The texture MV is the coded 
data of a texture motion vector which represents a 
motion of an image within a block between the present 
frame and the previous frame, and this is used for pre- 
diction of a texture signal in the present frame from a 
texture signal in the previous frame. Further, the texture 
DCT coefficients are obtained by subjecting the quan- 
tized texture signal to variable-length coding. 

In the coded arbitrary shape signal 500, actually, 
many pieces of side information (data) are arranged 
before the texture DCT coefficients although these data 
are not shown in the figure, and these side data some- 
times require a large number of bits. 

Figure 1(b) shows a data structure of a coded 
binary signal obtained by coding a binary image signal. 
In figure 1(b), reference numeral 600 designates a 
coded binary signal obtained by coding a binary image 
signal S2 that represents image information as a binary 
signal on one display image. The coded binary signal 
600 includes a 32-bit synchronous signal 601 at the 
beginning, a 1-bit shape identifier (image identifier) 602 
that follows the synchronous signal 601, and an addi- 
tional header 603 that follows the shape identifier 602. 
Further, the coded binary signal 600 includes coded 
shape bit streams 61 B obtained by coding shape sig- 
nals constituting the binary image signal S2. In figure 
1(b), the shape identifier is shown as SID. 

To be specific, in the coded binary shape signal 
600, the coded shape bit streams 61 B are successively 
arranged correspondingly to the respective blocks into 
which the display image is divided. More specifically, 
coded shape bit streams 61 B1, 61 B2. 61 B3. 61 B4, 
61 B5, ... respectively corresponding to the blocks B1, 
B2, B3, B4, B5, ... are arranged in this order. 

Further, the coded shape bit streams 61 B1 -61 B5 
comprise variable-length coded data 604, 606, 608, 
61 0, and 61 2 corresponding to shape motion vectors (in 
the figure, "shape MV"), and variable-length coded data 
605, 607, 609, 611, and 613 corresponding to shape 
signals representing the image on the display image by 
binary signals (in the figure, "shape data"), respectively. 

The synchronous signal 601, the shape identifier 
(SID) 602, and the header 603 are identical to the syn- 
chronous signal 501 , the shape identifier (SID) 502, and 
the header 503 in the coded arbitrary shape signal 500, 
respectively, and the data of the blocks B1-B5 ... are 
only the coded shape bit streams. Since the coded 
binary signal 600 includes no coded texture bit streams, 
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the shape identifier (SID) 602 is 1. 

In this first embodiment, the shape identifier (SID) 
502 in the coded arbitrary shape signal 500 and the 
shape identifier 602 in the coded binary signal 600 are 
disposed, for each frame, in the coded signals 500 and 5 
600, respectively. 

As shown in figures 1 (a) and 1 (b), the bit number of 
the coded data of the shape signal or the texture signal 
is variable because the bit number required for coding 
varies according to the image included in the display 10 
image or in the block of the display image. Therefore, it 
is impossible to predict the position of the quantization 
scale 506, i.e., how many bits exist between the head of 
the shape data 505 and the quantization scale 506. Fur- 
ther, whether the coded texture bit stream 52A is posi- is 
tioned after the coded shape bit stream 51 A or not 
cannot be decided without the shape identifier 502. 
Therefore, when coding and transmitting the shape sig- 
nal without the shape identifier, it is necessary to insert 
a dummy coded texture bit stream between the coded so 
shape bit streams of the adjacent blocks, but the inser- 
tion of the dummy coded texture bit stream increases 
the bit number when the shape signal is coded. By the 
way. instead of disposing at the header of each image, 
a shape identifier may be disposed at the head of the 
whole image sequence, or it may be disposed for a plu- 
rality of images. 

In MPEG4, besides the arbitrary shape image sig- 
nal and the binary image signal, transparency informa- 
tion representing the transparency of each pixel can be 
processed. Such transparency information is usually 
expressed by a signal of 3 bits or more, and it is separa- 
ble into a binary signal as shape information (binary 
transparency signal) and a multivalued transparency 
signal representing the transparency information 
excluding the shape information. In this case, the multi- 
valued transparency signal can be treated in the same 
manner as a texture signal. 

Hereinafter, a description is given of a process of 
coding a shape signal and a texture signal, which sig- 
nals are components of an arbitrary shape image sig- 
nal. 

Initially, with respect to a shape signal, an object 
region including an object on the display image is 
divided into plural blocks (shape blocks), and the shape 
signal is coded block by block. In this first embodiment, 
each shape block is a display region comprising 16x16 
samples (pixels), and a shape signal corresponding to 
each shape block is coded by the arithmetic coding 
method employed in JBIG. 

With respect to a texture signal, as described 
above, an object region including an object on the dis- 
play image is divided into plural blocks (texture blocks), 
and the texture signal is compressively coded block by 
block. In this first embodiment, each texture block is a 
display region comprising 16x16 samples (pixels). In 
the compressive coding process, DCT and quantization 
are carried out as follows. The texture block is divided 



into four display regions (sub-blocks) each comprising 8 
x8 samples (pixels), and the texture signal is subjected 
to DCT and quantization, sub-block by sub-block. Then, 
quantized signals provided by the above-described 
process are subjected to variable-length coding to gen- 
erate coded texture bit streams. 

The coded shape bit streams and the coded texture 
bit streams so generated are multiplexed together with 
other required signals, thereby creating a coded arbi- 
trary shape signal 500 having a data structure shown in 
figure 1(a). 

Further, a process of coding a shape signal as a 
constituent of a binary image signal in JBIG is almost 
identical to the above-described shape signal coding 
method employed for MPEG4. 

That is, a display image is divided into plural blocks 
(shape blocks), and the shape signal is coded block by 
block, thereby creating a coded binary signal 600 hav- 
ing a data structure shown in figure 1(b). In this first 
embodiment, the shape block is a display region com- 
prising 16x16 samples (pixels). A shape signal corre- 
sponding to each shape block is coded using the 
arithmetic coding method employed in JBIG. 

As described above, in the first embodiment of the 
invention, the coded binary signal corresponding to a 
binary image signal includes the coded shape bit 
streams 61 B obtained by coding the shape signal that 
represents the display image of binary information, and 
the shape identifier 602 indicating that only the coded 
shape bit streams are included as display data, it is pos- 
sible to analyze a coded binary signal including no 
coded texture bit streams, by referring to the image 
identifier, using a data analysis method adapted to a 
coded signal including coded texture bit streams. 

To be specific, using a data analysis method for 
analyzing a coded signal of an arbitrary shape image 
signal (coded arbitrary shape signal) including both of a 
shape signal and a texture signal (pixel value signal), a 
coded signal of a binary image signal including only a 
shape signal as display information (coded binary sig- 
nal) can be analyzed. Therefore, even though a dummy 
coded pixel value bit stream is not added to each coded 
shape bit stream when coding the binary image signal, 
the coded shape bit stream of the binary signal can be 
decoded by the decoding process based on MPEG4. 
That is, without increasing the number of bits in the cod- 
ing process, both the binary image signal and the arbi- 
trary shape image signal can be decoded by an image 
decoding apparatus adapted to a single coding method. 

As described above, the first embodiment of the 
invention employs, as a coded arbitrary shape signal 
and a coded binary signal, the coded arbitrary shape 
signal 500 and the coded binary signal 600 which are 
obtained by inter-frame predictive coding, respectively. 
It is needless to say that a coded arbitrary shape signal 
provided by intra-frame coding has a data structure 
excluding the shape MVs and texture MVs from the 
coded arbitrary shape signal 500 while a coded binary 
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signal provided by intra-frame coding has a data struc- 
ture excluding the shape MVs from the coded binary 
signal 600. 

Further, the first embodiment of the invention 
employs, as a coded arbitrary shape signal and a coded 
binary signal, the coded arbitrary shape signal 500 in 
which a coded shape bit stream and a coded texture bit 
stream are arranged for each block, and the coded 
binary signal 600 in which a coded shape bit stream is 
arranged for each block, respectively. However, a coded 
signal having a data structure for image transmission 
according to the invention is not restricted to those 
shown in figures 1(a) and 1(b). 

For example, a coded arbitrary shape signal 500b 
having a data structure shown in figure 2(a) may be 
employed, which data structure comprises, for each 
frame, a synchronous signal 501, a shape identifier 
(SID) 502, an additional header 503, a shape data part 
51, and a texture data part 52. Further, a coded binary 
signal 600b having a data structure shown in figure 2(b) 
may be employed, which data structure comprises, for 
each frame, a synchronous signal 601, a shape identi- 
fier (SID) 602, an additional header 603, and a shape 
data part 61. 

The shape data part 51 is obtained by collecting, for 
one frame, the coded shape bit streams corresponding 
to the respective blocks in the coded arbitrary shape 
signal as shown in figure 1(a). The texture data part 52 
is obtained by collecting, for one frame, the coded tex- 
ture bit streams corresponding to the respective blocks 
in the coded arbitrary shape signal shown in figure 1(a). 
Further, the shape data part 61 is obtained by collect- 
ing, for one frame, the coded shape bit streams corre- 
sponding to the respective blocks in the coded binary 
signal shown in figure 1(b). 

[Embodiment 2] 

Figure 3 is a block diagram illustrating an image 
coding apparatus according to a second embodiment of 
the present invention. In figure 3, reference numeral 
100a designates an image coding apparatus which cre- 
ates a coded image signal having a data structure for 
image transmission according to the first embodiment of 
the invention. This apparatus 100a is constructed as fol- 
lows. When the apparatus 100a receives, as an input 
image signal Sin, a binary image signal S2 for display- 
ing a binary image, from a scanner of a facsimile 
machine, it outputs a coded binary signal (E2) 600 
shown in figure 1(b) as a coded image signal Eo. On the 
other hand, when the apparatus 100a receives, as an 
input image signal Sin, a digital image signal Sp from a 
video camera or an image reproduction apparatus, it 
outputs a coded arbitrary shape signal (Ep) 500 shown 
in figure 1(a) as a coded image signal Eo for each 
object. 

The image coding apparatus 100a comprises an 
image input unit 1 10 which subjects the input image sig- 
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nal Sin to a pre-coding process to generate a binary 
shape signal and a texture signal, and outputs a switch 
control signal SWa; a shape coding unit (first coding 
means) 1 20 which subjects the shape signal to an arith- 

5 metic coding process; a texture coding unit (second 
coding means) 130 which subjects the texture signal to 
a coding process including DCT; a switch 101a which 
supplies the output from the image input unit 1 1 0 to one 
of the coding units 120 and 130, according to the switch 

10 control signal SWa; and a frame memory bank 102 
which stores a prediction shape signal and a prediction 
texture signal. 

The image input unit 1 1 0 is constructed as follows. 
The image input unit 110 distinguishes between a 

is binary image signal from a scanner or the like and a dig- 
ital video signal from a camera or an image record- 
ing/reproduction unit (VTR), and the binary image 
signal is subjected to thresholding to generate a shape 
signal while the digital video signal is subjected to a 

20 chromakey process to generate a texture signal. In the 
chromakey process, an arbitrary shape image signal 
corresponding to each object of a display image is sep- 
arated from an image signal of a background image. 
The arbitrary shape image signal separated is proc- 

25 essed to generate a shape signal representing the 
shape of the object, and a texture signal for gradation 
color display of the image and comprising a luminance 
signal and a color difference signal. Further, the image 
input unit 1 10 outputs an identifier signal (image identi- 

30 fier) Sdis that indicates which signal is processed 
between the binary image signal and the arbitrary 
shape image signal. 

To be specific, as shown in figure 7(a). the image 
input unit 1 10 comprises an identification unit 1 1 1 which 

35 decides whether the digital image input Sin is a binary 
image signal or an arbitrary shape image signal, and 
separates this signal; a thresholding unit 1 12 which sub- 
jects the binary image signal to thresholding for each of 
blocks which divide the display image, thereby creates a 

io binary shape signal; a chromakey unit 1 13 which sub- 
jects the arbitrary shape image signal to the chromakey 
process for each object region corresponding to each 
object on the display image, thereby generates a binary 
shape signal and a multivalued texture signal, and out- 

45 puts a timing signal T at the end of the processing for 
the texture signal; and a switch control circuit 1 14 which 
controls the switch 101a by the switch control signal 
SWa, on the basis of the identifier signal Sdis from the 
identification unit 111 and the timing signal T from the 

so chromakey unit 1 1 3. 

The shape coding unit 120 comprises a shape 
motion detector 121 which receives the shape signal 
S2k corresponding to the binary image or the shape sig- 
nal Spk corresponding to the arbitrary shape image, 

55 and generates a shape motion vector Skv by block 
matching or the like, for each block, while referring to a 
shape signal of a frame previous to the present frame 
which is being processed; and a shape motion compen- 
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sator 123 which transforms the shape motion vector 
Skv to an address of the frame memory bank 102a, and 
obtains a prediction shape signal Ekm from the frame 
memory bank 102a, based on the address. Further, the 
shape coding unit 1 20 comprises an arithmetic encoder 5 
1 22 which codes the shape signal Sk from the switch 
101a, block by block, by an arithmetic coding process 
using the prediction shape signal Ekm, thereby generat- 
ing a coded shape bit stream Ek corresponding to each 
block; and an arithmetic decoder 124 which decodes jo 
the coded shape bit stream Ek by an arithmetic decod- 
ing process using the prediction shape signal Ekm. The 
output Ekd from the arithmetic decoder 124 is stored in 
the frame memory bank 102a. 

The texture coding unit 130 comprises a texture is 
motion detector 131 which receives the texture signal 
Spt, and generates a texture motion vector Stv by block 
matching or the like, for each block, while referring to a 
texture signal of a frame previous to the present frame 
which is being processed; and a texture motion com- 20 
pensator 132 which transforms the texture motion vec- 
tor Stv to an address of the frame memory bank 102a, 
and obtains a prediction texture signal Epm from the 
frame memory bank 1 02a, based on the address. Fur- 
ther, the texture coding unit 130 comprises a first adder 25 
133 which adds the texture signal Spt and the prediction 
texture signal Epm; a cosine transformer 134 which 
subjects the output Sad1 from the adder 133 to block- 
by-block DCT, thereby generating DCT coefficients 
Edct; and a quantizer 1 35 which subjects the DCT coef- 30 
f icients Edct to block-by-block quantization. 

Furthermore, the texture coding unit 1 30 comprises 
a variable-length encoder 139 which performs the out- 
put Eq from the quantizer 135 to block-by-block varia- 
ble-length coding, thereby generating a coded texture 35 
bit stream Ep corresponding to each block; an inverse 
quantizer 136 which performs the output Eq from the 
quantizer 1 35 to inverse quantization; an inverse cosine 
transformer 137 which subjects the output Eiq from the 
inverse quantizer 136 to inverse DCT; and a second 40 
adder 138 which adds the output Eidct from the inverse 
cosine transformer 137 and the prediction texture signal 
Epm. The output Sad2 from the second adder 138 is 
stored in the frame memory bank 102a as a prediction 
texture signal. 45 

The image coding apparatus 100a further includes 
a multiplexer 150 which multiplexes the shape identifier 
Sdis from the image input unit 110, the shape motion 
vector Skv and the coded shape bit stream Ek from the 
shape coding unit 120, and the texture motion vector so 
Stv and the coded texture bit stream Ept from the tex- 
ture coding unit 130, together with other necessary 
data. The multiplexer 150 is constructed as follows 
When the binary image signal S2 is input, the multi- 
plexer 1 50 multiplexes the shape identifier Sdis and the 55 
coded shape bit stream from the shape coding unit 120, 
together with other necessary data, and outputs the 
coded binary signal (E2) 600 shown in figure 1(b). 
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When the arbitrary shape image signal Sp is input, the 
multiplexer 150 multiplexes the shape identifier Sdis, 
the coded shape bit stream from the shape coding unit 
120, and the coded texture bit stream from the texture 
coding unit 130, together with other necessary data, 
and outputs the coded arbitrary shape signal (Ep) 500 
shown in figure 1(a). 

A description is given of the operation of the image 
coding apparatus 100a. 

Figure 4 is a flowchart showing the coding process 
by the image coding apparatus according to the second 
embodiment of the invention. 

When a digital image input Sin is input to the image 
coding apparatus 100a (Step Sa1), the image input unit 
110 decides whether the digital image input Sin is a 
binary image signal including only a shape signal and 
output from a scanner, or a digital video signal including 
both of a shape signal and a texture signal and output 
from a camera or an image recording/reproduction 
apparatus (VTR) (Step Sa2). 

When the digital image input Sin is a binary image 
signal, the image input unit 110 generates, as a shape 
identifier (SID), a 1-bit identifier signal of which value is 
"1" (Step Sa3) and extracts a binary shape signal S2k 
by thresholding of the digital image input Sin. At this 
time, the switch 101a is controlled by the switch control 
signal SWa so that the shape signal S2k is input to the 
shape coding unit 120. and the shape coding unit 120 
performs arithmetic coding of the shape signal S2k 
(Step Sa4). 

That is, in the shape coding unit 120, the shape 
motion detector 121 generates a shape motion vector 
Skv from the shape signal S2k for each block. To be 
specific, the shape signal of the present frame, which is 
currently processed, is compared with the shape signal 
of the previous frame by block matching, thereby detect- 
ing a motion of image between the present frame and 
the previous frame, and the motion information is pro- 
vided as the motion vector Skv. This motion detection is 
described in more detail in, tor example, USP (United 
States Patent) No. 4897720. When the shape motion 
vector Skv is input to the shape motion compensator 
123, the compensator 123 transforms the shape motion 
vector Skv to an address of the frame memory bank 
102a, and fetches a prediction shape signal Ekm from 
the frame memory bank 102a, according to the address. 
When the prediction shape signal Ekm and the shape 
signal S2k from the image input unit 1 1 0 are input to the 
arithmetic encoder 122, the arithmetic encoder 122 
subjects the shape signal S2k to an arithmetic coding 
process based on the prediction shape signal Ekm, and 
outputs shape data (coded shape bit stream 61B) E2k 
obtained by coding the shape signal S2k. The arithmetic 
coding method is described in more detail in the specifi- 
cation of JBIG standard. At this time, in the arithmetic 
decoder 124, the shape data E2k is reproduced as 
decoded shape data Ekd in an arithmetic decoding 
process using the prediction shape signal Ekm, and the 
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decoded shape data Ekd is stored in the frame memory 
bank 102a. 

Thereafter, the shape identifier Sdis, the shape 
motion vector Skv, and the shape data E2k are multi- 
plexed in the multiplexer 150, and the multiplexed data 
is output from the output terminal 150a toward a data 
transmission unit (Step Sa5). 

In the image coding apparatus 100a, a control unit 
(not shown) decides whether or not the data processed 
through Steps Sa2~Sa5 is the data of the last block in 
the last frame (Step Sa10) and. when the processed 
data is not the data of the last block in the last frame, the 
process by the image coding apparatus 1 00a returns to 
Step Sa2. When it is the data of the last block in the last 
frame, the image coding process is ended. 

On the other hand, when it is decided in Step Sa2 
that the digital image input Sin is an arbitrary shape 
image signal Sp including a shape signal and a texture 
signal, a 1-bit identifier signal Sdis, of which value is "0", 
is generated as a shape identifier (SID) in the image 
input unit 110 (Step Sa6) and, furthermore, a shape sig- 
nal Spk and a texture signal Spt are generated by the 
chromakey process of the digital image input Sin. At this 
time, the switch 101 a is controlled by the switch control 
signal SWa output from the image input unit 1 1 0 so that 
the shape signal Spk is input to the shape coding unit 
120 while the texture signal Spt is input to the texture 
coding unit 130. 

Thereby, in the shape coding unit 120, the shape 
signal Spk of the arbitrary shape image signal Sp is 
subjected to arithmetic coding in the same manner as 
the above-described arithmetic coding of the shape sig- 
nal S2k of the binary image signal S2 (Step Sa7). In the 
texture coding unit 130, the texture signal Spt is sub- 
jected to coding (Step Sa8). 

More specifically, in the texture coding unit 130, the 
texture motion detector 131 extracts a texture motion 
vector Stv from the texture signal Spt, and the texture 
motion compensator 132 obtains a prediction texture 
signal Epm from the frame memory bank 1 02a accord- 
ing to the motion vector Stv. The first adder 133 obtains 
difference data Sad1 between the prediction texture sig- 
nal Epm and the texture signal Spt of the input arbitrary 
shape image signal. This difference data is transformed 
to DCT coefficients Edct in a frequency domain by the 
cosine transformer 134, and the DCT coefficients Edct 
are quantized by the quantizer 135. Further, the quan- 
tized DCT coefficients Eq are subjected to variable- 
length coding by the variable-length encoder 139 and 
sent to the multiplexer 150. To the multiplexer 150, the 
texture motion vector Stv of the corresponding block is 
input as well. 

The quantized DCT coefficients Eq are subjected to 
inverse quantization by the inverse quantizer 136. and 
the output Eiq from the inverse quantizer 136 is sub- 
jected to inverse cosine transformation by the inverse 
cosine transformer 137 and sent to the second adder 
138, as a reproduced texture signal Eidct. In the second 



adder 138, the reproduced texture signal Eidct and the 
prediction texture signal Epm are added, and Ihe sum 
Sad2 is stored in the frame memory bank 102a. 

When the digital image input is an arbitrary shape 

5 image signal, the multiplexer 150 is supplied with the 
shape identifier Sdis from the image input unit 110, the 
shape motion vector Skv and the shape data Epk from 
the shape coding unit 120, the texture motion vector Stv 
and the texture DCT coefficients Ept from the texture 

jo coding unit 130, and other necessary signals. In the 
multiplexer 150, these data are arranged in a prescribed 
order, and the coded arbitrary shape signal (Ep) 500 
shown in figure 1(a) is output from the multiplexer 150 
(Step Sa9). 

rs Furthermore, as described with respect to the 
binary image signal, in the image coding apparatus 
100a, a control unit (not shown) decides whether the 
data processed through Steps Sa6~Sa9 is data of the 
last block in the last frame (Step Sa1 0). When the proc- 

so essed data is not the data of the last block in the last 
frame, a shape signal and a texture signal correspond- 
ing to the next block are subjected to Steps Sa6~Sa9. 
When the processed data (shape signal and texture sig- 
nal) is the data of the last block in the last frame, the 

25 coding process by the image coding apparatus 100a is 
completed. 

The output terminal 150a of the multiplexer 1 50 is 
connected to a transmission unit for transmitting the 
coded data or a storage unit for storing the coded data, 

so although these units are not shown in the figure. 

As described above, according to the second 
embodiment of the invention, the image coding appara- 
tus 100a for coding a digital image input is constructed 
so that it decides whether the digital image input is a 

35 binary image signal or an arbitrary shape image signal, 
and outputs an identifier signal (shape identifier) Sids 
according to frie result of the decision, together with the 
coded image signal. Therefore, in a decoding apparatus 
for decoding a coded signal of a binary image signal or 

io an arbitrary shape image signal, when an input coded 
signal is a coded binary signal including no coded tex- 
ture bit streams, this is detected by the above-described 
shape identifier, and the content of the decoding proc- 
ess can be changed from the process adapted to a 

45 coded arbitrary shape signal including both of coded 
shape bit streams and coded texture bit streams to the 
process adapted to the coded binary signal. 

Therefore, when coding a binary image signal 
including only a shape signal, it is not necessary to add 

so a dummy coded texture bit stream to each coded shape 
bit stream, which dummy stream enables the binary 
shape signal to be decoded by a decoding method cor- 
responding to a coded arbitrary shape signal. As a 
result, it is possible to create a coded image signal hav- 

55 ing a data structure which enables coded signals 
according to different coding methods to be decoded by 
an image decoding apparatus that employs only a sin- 
gle coding method, without increasing the code quantity 
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required for the coding process. 
[Embodiment 3] 

Figure 5 is a block diagram illustrating an image 
decoding apparatus according to a third embodiment of 
the present invention. In figure 5, reference numeral 
100b designates an image decoding apparatus which 
performs decoding of coded data having a data struc- 
ture for image transmission according to the first 
embodiment of the invention. This apparatus 100b is 
constructed as follows. When the apparatus 100b 
receives a coded binary signal (E2) 600 as shown in fig- 
ure 1 (b) as coded data Eo, it outputs a decoded binary 
signal D2 as decoded image data Dsyn. On the other 
hand, when the apparatus 100b receives a coded arbi- 
trary shape signal (Ep) 500 as shown in figure 1(a) as 
coded data Eo, it outputs a decoded arbitrary shape sig- 
nal Dp as decoded image data Dsyn, for each object. 

More specifically, the image decoding apparatus 
100b comprises a data analyzer 160 which analyzes the 
coded data Eo input to the input terminal 160a and out- 
puts a switch control signal SWb according to the anal- 
ysis result; a shape decoding unit (first decoding 
means) 1 70 which subjects the shape data (the coded 
shape bit stream obtained by coding the shape signal) 
to an arithmetic decoding process; a texture decoding 
unit (second decoding means) 1 80 which subjects the 
texture DCT coefficients Ept to a decoding process 
including inverse DCT; a switch 1 01 b which supplies the 
coded signal analyzed by and output from the data ana- 
lyzer 160 to one of the decoding units 170 and 180, 
according to the switch control signal; and a frame 
memory bank 102 which stores a prediction shape sig- 
nal and a prediction texture signal. 

When decoding of a coded shape bit stream corre- 
sponding to one block has ended, the shape decoding 
unit 170 detects the end of the coded shape bit stream 
Ek (E2k or Epk) corresponding to the block, and outputs 
an end detection signal Te corresponding to the block 
toward the data analyzer 160. 

Further, the data analyzer 160 analyzes the input 
coded data to examine the 1 -bit shape identifier that fol- 
lows the 32-bit synchronous signal at the head, which 
shape identifier is one of the shape identifier 502 in the 
coded arbitrary shape signal 500 shown in figure 1 (a) 
and the shape identifier 602 in the coded binary signal 
600 shown in figure 1(b). When the set value of the 
shape identifier (SID) is 0, the data analyzer 160 con- 
trols the switch 101b with the switch control signal SWb 
so that the coded shape bit stream in the input coded 
signal is input to the shape decoding unit 1 70 and the 
coded texture bit stream in the coded signal is input to 
the texture decoding unit 180. When the set value of the 
shape identifier (SID) is 1. the data analyzer 160 con- 
trols the switch 101b with the switch control signal SWb 
so that the input coded data is input to the shape decod- 
ing unit 170. 



To be specific, as shown in figure 7(b), the data 
analyzer 160 comprises a table storage unit 162 which 
contains a decoding reference table; a comparator 161 
which compares the bit stream of the input coded signal 

5 with the bit stream of the decoding reference table, 
thereby identifying the shape identifier, the coded shape 
bit stream, and the coded texture bit stream, which are 
included in the coded data; and a switch control circuit 
163 which controls the switch 1 01b by the switch control 

,o signal SWb, according to the comparison result from the 
comparator 161 and the end detection signal Te. 

The shape decoding unit (first decoding means) 
1 70 comprises a shape motion compensator 172 which 
receives the shape motion vector corresponding to the 

is binary image or an arbitrary shape motion vector corre- 
sponding to the arbitrary shape image, generates an 
address of the prediction shape signal stored in the 
frame memory bank 102b according to the motion vec- 
tor, and obtains the prediction shape signal Emk from 

20 the frame memory bank 102b; and an arithmetic 
decoder 171 which decodes the input shape data Ek 
with reference to the prediction shape signal Emk to 
generate a reproduced shape signal (decoded shape 
signal) Dk. The reproduced shape signal Dk is stored in 

25 the Irame memory bank 102b. When decoding of a 
coded shape bit stream corresponding to one block has 
ended, the arithmetic decoder 171 detects the end of 
the coded shape bit stream Ek (E2k or Epk) corre- 
sponding to the block, and outputs an end detection sig- 

30 nal Te. The motion compensation is described, in more 
detail, in USP (United States Patent) No. 5157742, and 
the arithmetic decoding is described in the specification 
of JBIG 

The texture decoding unit (second decoding 
35 means) 180 comprises a texture motion compensator 
184 which generates, according to the texture motion 
vector, an address of the prediction texture signal, and 
obtains the prediction texture signal Emp from the frame 
memory bank 1 02b using the address; an inverse quan- 
go tizer 181 which receives the quantization scale and the 
DCT coefficients in the coded texture bit stream, and 
subjects the DCT coefficients to inverse quantization; 
an inverse cosine transformer 182 which subjects the 
inversely quantized DCT coefficients Diq to inverse 
45 cosine transformation; and an adder 183 which adds 
the prediction texture signal Emp and the inversely 
cosine-transformed signal Didct, and outputs a repro- 
duced texture signal (decoded texture signal) Dpt. The 
reproduced texture signal Dpt from the adder 183 is 
so stored in the frame memory bank 1 02b. 

The image decoding apparatus 100b further 
includes a synthesizer 190 which synthesizes the repro- 
duced texture signal Opt and the corresponding repro- 
duced shape signal Dk with desired image data, 
55 respectively, and outputs synthesized image data Dsyn 
including the reproduced arbitrary shape signal Dp cor- 
responding to the coded arbitrary shape signal Ep 
toward the display unit 104. Further, the synthesizer 190 
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outputs the reproduced binary signal D2 corresponding 
to the coded binary signal E2. Although the signals out- 
put from the synthesizer 1 90 are input to the display unit 
104, the signals may be input to a printer (not shown). 
Further, in the synthesizer 190, the reproduced binary 
signal D2 may be synthesized with another arbitrary 
shape image signal. 

In this third embodiment, when decoding the arbi- 
trary shape image signal, with respect to pixels wherein 
the reproduced shape signal is zero, pixel values in the 
reproduced texture signal are replaced with pixel values 
of a prescribed image. The prescribed image is an 
image prepared in advance at the receiving end, or an 
image reproduced by another image decoding appara- 
tus. 

A description is given of the operation of the image 
decoding apparatus so constructed. 

Figure 6 is a flowchart of the decoding process by 
the image decoding apparatus 100b according to the 
third embodiment of the invention. When a coded image 
signal having a data structure shown in figure 1 (a) or 
1(b) is input to the image decoding apparatus 100b, the 
data analyzer 1 60 analyzes the 1 -bit shape identifier 
that follows the 32-bit synchronous signal, and decides 
whether the value of the shape identifier (SID) is 1 or not 
(StepSbl). 

When the result of the decision is SID = 1 , the input 
coded image signal Eo is a coded binary signal E2 that 
includes only coded shape bit streams as image infor- 
mation. So, the data analyzer 160 controls the switch 
101b with the switch control signal SWb so that the 
coded shape bit stream corresponding to each block of 
the coded binary signal is always input to the shape 
decoding unit 170. Thereby, in the shape decoding unit 
170, the coded shape bit streams in the coded binary 
signal are subjected to decoding (Step Sb2). 

More specifically, the shape motion vector MV is 
input to the shape motion compensator 172 and the 
arithmetic decoder 171. Then, the shape motion com- 
pensator 172 generates an address of the prediction 
shape signal stored in the frame memory bank 102b 
according to the shape motion vector MV, and obtains 
the prediction shape signal Emk from the frame mem- 
ory bank 102b. When the prediction shape signal Emk 
is input to the arithmetic decoder 171, the arithmetic 
decoder 171 performs arithmetic decoding of the coded 
shape bit stream E2k with reference to the prediction 
shape signal Emk, thereby generating a decoded binary 
signal D2k. The decoded binary signal D2k is output 
toward the frame memory bank 102b and the synthe- 
sizer 190 and, from the synthesizer 190, it is output 
toward the display unit 104 for image display (Step 
Sb6). The decoded binary signal may be displayed after 
being synthesized with another arbitrary shape image 
signal. 

Thereafter, it is decided whether the coded binary 
signal is data of the last block in the last frame (Step 
Sb7). When the coded binary signal is not data of the 



last block in the last frame, a coded binary signal corre- 
sponding to the next block is subjected to Steps Sb1 , 
Sb2, and Sb6. When the coded binary signal is data of 
the last block in the last frame, decoding of the coded 

5 binary signal is ended. 

On the other hand, when the result of the decision 
in Step Sb1 is SID = 0, the input coded image signal 
includes, as image information, coded shape bit 
streams and coded texture bit streams. Therefore, the 

w data analyzer 160 controls the switch 101b with the 
switch control signal SWb so that the coded shape bit 
stream corresponding to each block of the coded arbi- 
trary shape signal is input to the shape decoding unit 
170 while the coded texture bit stream corresponding to 

is each block is input to the texture decoding unit 180. 
Thereby, the coded shape bit stream and the coded tex- 
ture bit stream are separated from each other (Step 
Sb3), and the coded shape bit stream is decoded by the 
shape decoding unit 170 while the coded texture bit 

20 stream is decoded by the texture decoding unit 180 
(Steps Sb4 and Sb5). 

To be specific, the coded shape bit stream Epk is 
decoded by the shape decoder 170, like the coded 
shape bit stream E2k of the binary image signal. When 

25 decoding of the coded shape bit stream Epk corre- 
sponding to one block has ended, the arithmetic 
decoder 171 detects the end of the coded shape bit 
stream Epk corresponding to the block and outputs an 
end detection signal Te toward the data analyzer 160. 

30 Receiving the end detection signal Te, the data analyzer 
160 controls the switch 101b with the switch control sig- 
nal SWb so that the input coded image signal is input to 
the texture decoding unit 180. 

Thereby, the coded texture bit stream Ept is sent 

35 from the data analyzer 1 60 to the texture decoding unit 
180. In the decoding unit 180, the quantization scale 
and the DCT coefficients in the bit stream are input to 
the inverse quantizer 181 , and the DCT coefficients are 
subjected to inverse quantization. Thereafter, the 

ao inversely quantized signals Diq are subjected to inverse 
cosine transformation by the inverse cosine transformer 
182. 

Meanwhile, the texture motion vector MV is input to 
the texture motion compensator 184. The compensator 

45 184 generates an address for obtaining a prediction tex- 
ture signal according to the texture motion vector MV, 
and obtains the prediction texture signal Emp from the 
frame memory bank 1 02b using the address. 

The adder 1 83 performs addition of the output Didct 

so from the inverse cosine transformer 182 and the predic- 
tion texture signal Emp, and outputs the decoded tex- 
ture signal Dpt. The decoded texture signal Dpt is input 
to the frame memory bank 102b and the synthesizer 
190. The synthesizer 190 synthesizes the decoded and 

55 reproduced texture signal Dpt and the corresponding - 
decoded and reproduced shape signal Dk with a 
desired image to generate synthesized image data 
Dsyn. When decoding of the coded texture bit stream 
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has ended, the data analyzer 1 60 detects the end of the 
coded texture bit stream, and controls the switch 101b 
by the switch control signal SWb so that the input coded 
image signal is input to the shape decoding unit 170. 

Further, the reproduced and synthesized data is 
output from the synthesizer 190 toward the display unit 
104 for image display (Step Sb6). 

Thereafter, it is decided whether the coded arbitrary 
shape signal is data of the last block in the last frame 
(Step Sb7). When the coded arbitrary shape signal is 
not data of the last block in the last frame, a coded arbi- 
trary shape signal of the next block is subjected to Steps 
Sb3~Sb7. When the coded arbitrary shape signal is 
data of the last block in the last frame, decoding of the 
coded arbitrary shape signal is ended. 

In this third embodiment, with respect to pixels 
wherein the reproduced shape signal is zero, pixel val- 
ues in the decoded and reproduced texture signal are 
replaced with pixel values of a prescribed image. The 
prescribed image is an image prepared in advance at 
the receiving end, or an image reproduced by another 
image decoding apparatus. 

As described above, according to the third embodi- 
ment of the invention, the image decoding apparatus 
1 00b includes the data analyzer 160 which analyzes the 
input coded data. The data analyzer 160 detects a 
shape identifier which indicates whether the input coded 
image signal is a coded binary signal including only 
coded shape bit streams as image information or a 
coded arbitrary shape signal including both of coded 
shape bit streams and coded texture bit streams and, 
according to the value of the shape identifier, the ana- 
lyzer 160 controls the switch 101b so that the bit stream 
of the input coded signal is input to one of the shape 
decoding unit 170 and the texture decoding unit 180. 
So, even though a dummy coded texture bit stream is 
not added to each coded shape bit stream when creat- 
ing a coded binary signal, it is possible to perform 
decoding of the coded binary signal by a decoding 
method corresponding to a coded arbitrary shape sig- 
nal. Therefore, when both of a binary image signal and 
an arbitrary shape image signal, which have different 
data structures, are coded by a single coding method 
and these coded signals are decoded by a single 
decoding method, it is possible to efficiently perform 
coding, transmission, and decoding of the binary image 
signal with suppressed bit number. 

[Embodiment 4] 

Figure 8 is a block diagram illustrating an image 
decoding apparatus according to a fourth embodiment 
of the present invention. In figure 8, reference numeral 
100c designates an image decoding apparatus accord- 
ing to this fourth embodiment. The image decoding 
apparatus 100c includes a controller 105, in addition to 
the structure of the image decoding apparatus 100b 
according to the third embodiment. The controller 105 



monitors the load on the decoding process of the tex- 
ture decoding unit 180 according to the operating state 
of the synthesizer 190c, and outputs an overload detec- 
tion signal Lov when the load exceeds a prescribed 
5 value. 

Further, the image decoding apparatus 100c 
includes, instead of the data analyzer 160 according to 
the third embodiment, a data analyzer 1 66 which gener- 
ates a switch control signal SWc according to the end 

10 detection signal Te and the overload detection signal 
Lov. The other structure of the data analyzer 166 is 
identical to that of the data analyzer 1 60. Further, the 
image decoding apparatus 100c includes, instead of the 
switch 101baccording to the third embodiment, a switch 

15 101c which has an input terminal Cin, to which a coded 
image signal from the data analyzer 1 66 is input, and 
three output contacts Co1, Co2 and Co3, and selects 
one of the three contacts in response to the switch con- 
trol signal SWc. The output contact Col is connected to 

so the input of the arithmetic decoder 1 71 , the output con- 
tact Co2 is grounded, and trie output contact Co3 is 
connected to the input of the inverse quantizer 181 and 
to the input of the motion compensator 184. 

Furthermore, the controller 105 has a plurality of 

25 functions for managing the whole decoding apparatus. 
One of the functions is as follows. When synthesis of an 
image corresponding to one frame is not completed 
within a time interval for image display of one frame, 
which interval is set in advance, the controller 105 

30 detects that the load Lde on the decoding process in the 
texture decoding unit 180 is heavy and outputs the 
switch control signal SWc. 

Receiving the overload detection signal Lov, the 
data analyzer 1 66 controls the switch 1 01 c by the switch 

35 control signal SWc so that the input terminal Cin is 
switched between the first output contact Co1 and the 
second output contact Co2. More specifically, the switch 
101c is controlled so that the coded texture bit stream 
Ept of the coded arbitrary shape signal Ep is input to the 

40 grounded end while the coded shape bit stream Ekt of 
the coded arbitrary shape signal Ep is input to the arith- 
metic decoder 171. 

Further, in this fourth embodiment, the synthesizer 
190c has the following function in addition to the func- 

45 tion of the synthesizer 190 according to the third 
embodiment. That is, when the decoding process of the 
coded texture signal is heavily loaded as described 
above, the synthesizer 190c synthesizes the decoded 
shape signal corresponding to the present frame being 

so processed with the decoded texture signal of the previ- 
ous frame, instead of the decoded texture signal corre- 
sponding to the present frame. 

When the data analyzer 166 is not supplied with the 
overload detection signal Lov, i.e., when the image 

55 decoding process is not heavily loaded, the data ana- 
lyzer 166 controls the switch 101c by the switch control 
signal SWc so that the input terminal Cin is connected 
to one of the first output contact Co1 and the third output 
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contact Co3. Thereby, the image decoding apparatus 
1 00c according to the fourth embodiment performs an 
image decoding process identical to that of the image 
decoding apparatus 1 00b according to the third embod- 
iment. 5 

A description is given of the operation of the image 
decoding apparatus 100c according to the fourth 
embodiment. 

Since the fundamental operation of the image 
decoding apparatus 100c is identical to that of the io 
image decoding apparatus 100b according to the third 
embodiment, only the operation relating to the controller 
105 will be described hereinafter. 

For example, when the coded arbitrary shape sig- 
nal Ep is being decoded in the texture decoding unit is 
180, if synthesis of the decoded image signal corre- 
sponding the frame being processed is not completed 
by the time when the image of this frame is to be dis- 
played, the controller 105 detects that the image decod- 
ing process is heavily loaded and outputs the overload 20 
detection signal Lov toward the data analy2er 166. 
Then, the switch 1 01c is controlled by the switch control 
signal SWc from the data analyzer 166, whereby the 
coded shape bit stream Epk of the coded arbitrary 
shape signal Ep from the data analyzer 1 66 is supplied 25 
through the first output contact Co1 of the switch 101c 
to the shape decoding unit 1 70 while the coded texture 
bit stream Ept of the coded arbitrary shape signal Ep is 
supplied through the second output contact Co2 of the 
switch 101c toward the ground. 30 

Further, instead of the decoded texture signal cor- 
responding to the present frame being processed, the 
decoded texture signal of the previous frame is synthe- 
sized with the decoded shape signal corresponding to 
the present frame. 35 

As described above, according to the fourth 
embodiment of the invention, the load of the arithmetic 
processor in the image decoding apparatus is moni- 
tored and, when decoding of the arbitrary shape image 
signal corresponding to each frame is late for the timing 40 
to display the image corresponding to each frame, only 
the coded shape bit stream in the coded arbitrary shape 
signal is reproduced, and image display is performed 
using a coded texture bit stream which has previously 
been reproduced. Therefore, even when the decoding 45 
process is heavily loaded, it is possible to realize image 
display in which the displayed image is not discontinued 
and the image flows smoothly. 

In this fourth embodiment, as a method of detecting 
the load on the decoding process, when the decoded so 
image signal corresponding to each frame is late for the 
timing to display the image corresponding to the frame, 
it is decided that the decoding process is heavily 
loaded. However, the load detecting method is not 
restricted thereto. 55 

Further, in this fourth embodiment, when the 
decoding process is over-loaded, only the coded shape 
bit stream in the coded arbitrary shape signal is 



decoded and the coded texture bit stream in the coded 
arbitrary shape signal is not decoded. However, only the 
coded texture bit stream may be decoded without 
decoding the coded shape bit stream. 

In this case, instead of the decoded shape signal 
corresponding to the present frame being processed, 
the decoded shape signal corresponding to the previ- 
ous frame is synthesized with the decoded texture sig- 
nal corresponding to the present frame. 

[Embodiment 5] 

Figure 9 is a block diagram illustrating an image 
decoding apparatus according to a fifth embodiment of 
the present invention. In figure 9, reference numeral 
100d designates an image decoding apparatus accord- 
ing to this fifth embodiment. The image decoding appa- 
ratus 100d includes an input terminal 106, such as a 
mouse or a remote controller, for inputting a manual 
control signal Min from the outside to the image decod- 
ing apparatus 100d, instead of the controller 105 for 
detecting the load on the decoding process, included in 
the image decoding apparatus 100c according to the 
fourth embodiment. The manual control signal Min from 
the input terminal 106 is input to the data analyzer 165, 
instead of the overload detection signal Lov. Other parts 
are identical to those described for the fourth embodi- 
ment. 

Receiving the manual control signal Min, Ihe data 
analyzer 1 65 controls the switch 1 0 1 c by the switch con- 
trol signal SWc so that the input terminal Cin is con- 
nected to one of the first output contact Co1 and the 
second output contact Co2. More specifically, the switch 
101c is controlled so that the coded texture bit stream 
Ept of the coded arbitrary shape signal Ep is input to the 
grounded end while the coded shape bit stream Ekt of 
the coded arbitrary shape signal Ep is input to the arith- 
metic decoder 171. 

Also in this fifth embodiment, as in the fourth 
embodiment, when the manual control signal Min is 
being output toward the data analyzer 165, instead of 
the decoded texture signal corresponding to the present 
frame being processed, the decoded texture signal cor- 
responding to the previous frame is synthesized with the 
decoded shape signal corresponding to the present 
frame. 

When the data analyzer 165 does not receive the 
manual control signal Min, it controls the switch 101 c by 
the switch control signal SWc so that the input terminal 
Cin is connected to one of the first output contact Co1 
and the third output contact Co3. Thereby, the image 
decoding apparatus 100d according to the fifth embodi- 
ment performs an image decoding process identical to 
that of the image decoding apparatus 1 00b according to 
the third embodiment. 

As described above, the image decoding apparatus 
100d according to the fifth embodiment is provided with 
the input terminal 106, such as a mouse or a remote 
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controller, for inputting the manual control signal Min 
from the outside, and the control of the switch 101c by 
the data analyzer 165 is changed according to the man- 
ual control signal Min from the input terminal 106. 
Therefore, until the user obtains a reproduced image by 
the user's control, only the coded shape bit stream can 
be decoded without decoding the coded texture bit 
stream, and this enables the user to quickly detect the 
head of the object image when reproducing a coded sig- 
nal recorded in a record medium. 

In the second to fifth embodiments of the invention, 
the coding process or the decoding process is applied 
to the coded arbitrary shape signal 500 (figure 1(a)) and 
the coded binary signal 600 (figure 1(b)) which corre- 
spond to the inter-frame prediction coding method. 
However, the fundamental principle of the present 
invention, i.e., to add an image identifier for identifying 
image signals having different data structures, such as 
an arbitrary shape image signal and a binary image sig- 
nal when coding these image signals for transmission, 
and to detect the image identifier when decoding these 
coded signals so as to make the decoding process cor- 
respond to the coded signals, such as a coded arbitrary 
shape signal and a coded binary signal, can be applied 
to the coded arbitrary shape signal 500b having the 
data structure shown in figure 2(a) and the coded binary 
signal 600b having the data structure shown in figure 
2(b). 

Further, in the first to fifth embodiments of the 
invention, as image identifiers for identifying coded 
image signals having different data structures, 
employed are shape identifiers (SID = 0 or 1) each com- 
prising a 1-bit code for identifying a coded arbitrary 
shape signal or a coded binary signal. However, image 
identifiers comprising 2-bit codes (SID = 00, 01, 10. or 
11) or 3-bit codes may be employed. In this case, it is 
possible to identify more coded image signals by the 
image identifiers. 

When a plurality of coded image signals are identi- 
fied with the image identifiers, for a prescribed coded 
image signal, an image identifier, of which bit number is 
different from those ol image identifiers of other coded 
image signals, may be employed. For example, a coded 
image signal including an image identifier comprising a 
2-bit code and a coded image signal including an image 
identifier comprising a 3-bit code may be treated by a 
system corresponding to a single coding method. In this 
case, code allocation to each image identifier can be 
carried out with high efficiency. 

Hereinafter, a description is given of a data struc- 
ture for image transmission and an image decoding 
method, which employ image identifiers comprising 
codes of two or more bits. 

[Embodiment 6] 

Figures 10(a)-10(c) are diagrams for explaining 
data structures far image transmission according to a 



sixth embodiment of the present invention. Figure 10(a) 
shows a data structure of a coded arbitrary shape signal 
obtained by coding an arbitrary shape image signal cor- 
responding to each of plural objects constituting a dis- 

5 play image. Figure 10(b) shows a data structure of a 
coded binary signal obtained by coding a binary image 
signal. Figure 10(c) shows a data structure of a coded 
rectangle signal (coded pixel value signal) obtained by 
coding a rectangle image signal including, as display 

w data, only a texture signal (pixel value signal) for color 
display of an image. 

These data structures for image transmission 
according to the sixth embodiment are applied to a sys- 
tem in which a rectangle image signal including, as dis- 

15 play data, only a texture signal (i.e., luminance signal 
and color difference signal) is compressively coded and 
transmitted and, further, the coded rectangle image sig- 
nal is decoded and displayed. That is, in this system, the 
image coding method and apparatus are constructed to 

so perform a coding process for the rectangle image signal 
while the image decoding method and apparatus are 
constructed to perform a decoding process for the 
coded rectangle image signal (coded pixel value signal). 
In figure 10(a), reference numeral 1500 designates 

25 a coded arbitrary shape signal obtained by coding an 
arbitrary shape image signal corresponding to each of 
plural objects constituting an image of one display 
image. This coded arbitrary shape signal 1500 is identi- 
cal to the coded arbitrary shape signal 500 according to 

30 the first embodiment except that a two-bit image identi- 
fier 1502 is used instead of the 1-bit shape identifier 
502. 

In figure 10(b), reference numeral 1600 designates 
a coded binary signal obtained by coding a binary 

35 image signal that represents image information on one 
display image with a binary signal. The coded binary 
signal 1600 is identical to the coded binary shape signal 
600 according to the first embodiment except that a two- 
bit image identifier 1602 is employed instead of the 1 -bit 

40 shape identifier 602. 

In figure 10(c), reference numeral 1700 designates 
a coded pixel value signal which is a coded rectangle 
signal obtained by coding a rectangle signal that 
includes information about the vertical and horizontal 

45 size of one display image (one frame). This coded pixel 
value signal (coded rectangle signal) 1700 is identical to 
the coded pixel value signal 700a shown in figure 23(c) 
except that a two-bit image identifier 1702 is inserted 
between the synchronous signal 701 and the header 

50 703. 

The 2-bit image identifiers (SID) 1502, 1602, and 
1702 have values of 01, 10. and 00, respectively, and 
the coded signals 1500, 1600, and 1700 are identified 
by the values of these image identifiers. 
55 A description is given of the function and effect. 

In the sixth embodiment so constructed, the coded 
pixel value signal 1700 corresponding to the rectangle 
image signal includes coded texture bit streams 71 C 
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obtained by coding a texture signal, and a two-bit image 
identifier 1702 indicating that the signal 1700 includes 
only the coded texture bit streams as display data. So. 
by referring to the image identifier 1702, among the 
three coded signals, i.e., the coded arbitrary shape sig- 
nal 1500, the coded binary signal 1600, and the coded 
pixel value signal 1700, the coded pixel value signal 
1700 can be distinguished from the other signals 1500 
and 1600. 

To be specific, in an image decoding apparatus 
adapted to MPEG2 and constructed to decode the 
coded pixel value signal 1700, even when the coded 
arbitrary shape signal 1 500 or the coded binary signal 
1 600 is input, these coded signals can be recognized as 
signals different from the coded pixel value signal 1 700 
by the image identifier. Therefore, it is possible to avoid 
that the coded arbitrary shape signal 1500 or the coded 
binary signal 1600 is subjected to the decoding process 
adapted to MPEG2 and the decoding operation comes 
to a rupture. Further, when these coded signals are 
input, the fact that these signals are not decodable can 
be displayed on the display image. 

In this sixth embodiment of the invention, the coded 
arbitrary shape signal 1500. the coded binary signal 
1600, and the coded rectangle signal (coded pixel value 
signal) 1700 have the data structures obtained by inter- 
frame prediction coding of an arbitrary shape image sig- 
nal, a binary image signal, and a rectangle image sig- 
nal, respectively. However, these coded signals may 
have data structures obtained by intra-frame coding of 
the respective image signals. 

Furthermore, in the respective coded signals 
employed in this sixth embodiment, both or either of the 
coded shape bit stream and the coded texture bit 
stream are/is arranged for each block. However, in each 
coded signal, prescribed coded bit streams may be 
arranged for each frame. 

[Embodiment 7] 

Figure 1 1 is a block diagram illustrating an image 
decoding apparatus according to a seventh embodi- 
ment of the invention. In figure 1 1 , reference numeral 
100e designates an image decoding apparatus for 
decoding coded data having the data structures for 
image transmission according to the sixth embodiment 
of the invention. This image decoding apparatus 1 0Oe is 
constructed as follows. When the apparatus 100e 
receives, as coded data Eo, the coded arbitrary shape 
signal (Ep) 1500 shown in figure 10(a) and the coded 
binary signal (E2) 1600 shown in figure 10(b), it does 
not decode these coded signals but displays that these 
coded signals currently being input are not decodable. 
On the other hand, when the apparatus 1 0Oe receives, 
as coded data Eo, the coded rectangle signal (Et) 1700 
shown in figure 10(c), it outputs a decoded rectangle 
signal Dt as decoded image data. 

More specifically, the image decoding apparatus 



100e comprises a data analyzer 160e which analyzes 
the coded data Eo input to the input terminal 160a and 
outputs a switch control signal SWe according to the 
analysis result; a texture decoding unit 180e which sub- 

5 jects the coded rectangle signal (coded pixel value sig- 
nal) Et to a decoding process including inverse DCT; a 
switch 101e which supplies the coded signal analyzed 
by and output from the data analyzer 160e to one of the 
decoding unit 180e and the grounded end; and a frame 

w memory bank I02e which stores a prediction texture 
signal. 

The data analyzer 1 60e analyzes the input coded 
data to examine the two-bit image identifier that follows 
the 32-bit synchronous signal at the head, which image 

75 identifier is one of the image identifier (SID = 01) 1502 
in the coded arbitrary shape signal 1 500 shown in figure 
10(a). the image identifier (SID = 10) 1602 in the coded 
binary signal shown in figure 10(b). and the image iden- 
tifier (SID = 00) 1702 in the coded rectangle signal 

so shown in figure 10(c). When the set value of the image 
identifier (SID) is 01 or 10, the data analyzer 160e con- 
trols the switch 101e with the switch control signal SWe 
so that the input coded signal is applied to the grounded 
end. When the set value of the image identifier (SID) is 

25 00, the data analyzer 1 60e controls the switch 1 0 1 e with 
the switch control signal SWe so that the input coded 
data is input to the decoding unit 180e. 

Further, the structure of the data analyzer 160e is 
identical to that of the data analyzer 160 according to 

30 the third embodiment shown in figure 7(b), and the 
structure of the texture decoding unit 1 80e is identical to 
that of the texture decoding unit 180 according to the 
third embodiment. 

in the image decoding apparatus 100e, the 

3s decoded (reproduced) texture signal Dt, i.e., the output 
from the decoding unit I80e, is directly input to the 
frame memory bank 102e and to the display unit 104. 
However, a synthesizer identical to the synthesizer 190 
employed for the third embodiment (refer to figure 5) 

40 may be disposed before the display unit 104 so that the 
decoded texture signal Dt is synthesized with another 
decoded arbitrary shape signal. 

A description is given of the operation of the image 
decoding apparatus 100e. 

45 Figure 1 2 is a flowchart of the decoding process by 
the image decoding apparatus 100e according to the 
seventh embodiment of the invention. When a coded 
image signal having a data structure shown in any of fig- 
ures 10(a), 10(b), and 10(c) is input to the image decod- 

50 ing apparatus lOOe, the data analyzer 160e analyzes 
the 2-bit image identifier that follows the 32-bit synchro- 
nous signal, and decides whether the value of the 
image identifier (SID) is 00 or not (Step Se1). 

When the result of the decision is not SID = 00 but 

55 SID = 01 or SID = 10. since the input coded image sig- 
nal Eo is a coded arbitrary shape signal Ep or a coded 
binary signal E2, the data analyzer 160e controls the 
switch 101 e with the switch control signal SWe so that 



23 



45 EP 0 891 093 A2 46 



the coded signal is supplied to the grounded end 
Thereby, the coded arbitrary shape signal Ep or the 
coded binary signal E2 is discarded (Step Se3). There- 
fore, the coded image signal E2 or Ep including coded 
shape bit streams is never processed by the decoding 
unit 180e. 

While the coded arbitrary shape signal Ep or the 
coded binary signal E2 is being input to the apparatus 
100e, the apparatus 100e displays a message that the 
coded signal being currently input cannot be processed 
by this decoding apparatus lOOe (Step Se4). 

Thereafter, it is decided whether the input coded 
signal is data of the last block of the last frame (Step 
Se6). When the input coded signal is not data of the last 
block of the last frame, an input coded signal corre- 
sponding to the next block is subjected to Steps Sel, 
Se3, Se4, and Se6. Conversely, when the input signal is 
data of the last block of the last frame, decoding of the 
input signal is ended. 

On the other hand, when the result of the decision 
in Step Se1 is SID = 00, since the input coded data Eo 
is a coded rectangle signal Et, the data analyzer I60e 
controls the switch 101e with the switch control signal 
SWe so that the coded signal is input to the decoding 
unit 180e. Thereby, the texture decoding unit 180e 
decodes the coded texture bit stream included in the 
coded rectangle signal Et (Step Se2). 

To be specific, in the decoding unit 180e, the quan- 
tization scale and the DCT coefficients included in the 
bit stream are input to the inverse quantizer 181, and 
the DCT coefficients are subjected to inverse quantiza- 
tion. Thereafter, the inversely quantized signal Diq is 
subjected to inverse cosine transformation by the 
inverse cosine transformer 182. 

Meanwhile, the texture motion vector MV is input to 
the texture motion compensator 1 84. The compensator 
1 84 generates an address for obtaining a prediction tex- 
ture signal according to the texture motion vector MV, 
and obtains the prediction texture signal Emp from the 
frame memory bank 102e using the address. 

The adder 1 83 performs addition of the output Didct 
from the inverse cosine transformer 182 and the predic- 
tion texture signal Emp, and outputs the decoded tex- 
ture signal Dpt (decoded rectangle signal) toward the 
display unit 104 (Step Se5). Simultaneously, the 
decoded texture signal Dt is input to the frame memory 
bank 102e. A synthesizer may be disposed after the 
decoding unit to synthesize the decoded rectangle sig- 
nal with another arbitrary shape image signal, for dis- 
play. 

Thereafter, it is decided whether the coded rectan- 
gle signal Et is data of the last block of the last frame 
(Step Se6). When the signal Et is not data of the last 
block of the last frame, a coded rectangle signal of the 
next block is subjected to Steps Se1, Se2, Se5, and 
Se6. When the signal Et is data of the last block of the 
last frame, decoding of the coded rectangle signal is 
ended. 



As described above, according to the sixth embod- 
iment of the invention, the image decoding apparatus 
100e includes the data analyzer 160e which analyzes 
the input coded data. The data analyzer 1 60e detects, 

5 with the two-bit image identifier, whether the coded 
image signal input to the apparatus is a coded signal 
including coded shape bit streams as image information 
(i.e., the coded binary signal E2 or the coded arbitrary 
shape signal Ep) or a coded rectangle signal Et includ- 

10 ing no coded shape bit streams and, according to the 
result of the decision, the analyzer I60e controls the 
switch lOle so that the input coded signal is input to 
one of the grounded end and the decoding unit 1 80e. 
Therefore, even when the coded arbitrary shape signal 

15 Ep or the coded binary signal E2 is input to the image 
decoding apparatus 1 0Oe adapted to the coded rectan- 
gle signal Et, these coded signals are not decoded, 
whereby the decoding operation is prevented from com- 
ing to a rupture 

20 In this seventh embodiment of the invention, 
emphasis has been placed on the image decoding 
apparatus 1 0Oe which selects, according to an image 
identifier, a coded signal including coded shape bit 
streams from the coded signals having the data struc- 

2s tures according to the sixth embodiment, i.e. , the coded 
arbitrary shape signal 1500, the coded binary signal 
1600, and the coded rectangle signal 1700. However, 
when the image input unit 110 in the image coding 
apparatus 1 00a according to the second embodiment 

3D (refer to figure 3) is constructed so that it can identify an 
arbitrary shape image signal, a binary image signal, and 
a rectangle image signal, it is possible to implement an 
image coding apparatus that performs coding of these 
image signals with image identifiers for identifying the 

35 coded signals corresponding to these image signals. 

[Embodiment 8] 

Figures 13(a) and 13(b) and figures 14(a) and 14(b) 

40 are diagrams for explaining data structures for image 
transmission according to an eighth embodiment of the 
present invenlion. More specifically, figure 13(a) shows 
a data structure of a coded arbitrary shape signal (first 
coded arbitrary shape signal) 2500 obtained by coding 

is an arbitrary shape image signal including both of coded 
shape bit streams and coded texture bit streams, and 
figure 1 3(b) shows a data structure of a coded binary 
signal 2600 obtained by coding a binary image signal. 
Figure 14(b) shows a data structure of a coded rectan- 

50 gle signal (coded pixel value signal) 2700 obtained by 
coding a rectangle image signal, and figure 14(b) shows 
a data structure of a coded arbitrary shape signal with 
transparency information (second coded arbitrary 
shape signal) 2800 obtained by coding an arbitrary 

55 shape image signal including transparency information. 
These data structures for image transmission 
according to the eighth embodiment are applied to a 
system in which the arbitrary shape image signal includ- 
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ing transparency information is compressively coded 
and transmitted and, further, the second coded arbitrary 
shape signal obtained by the compressive coding is 
decoded and displayed. That is, in this system, the 
image coding method and apparatus are constructed to 
perform a coding process for the arbitrary shape image 
signal including transparency information while the 
image decoding method and apparatus are constructed 
to perform a decoding process for the second coded 
arbitrary shape signal. 

The first coded arbitrary shape signal 2500 shown 
in figure 13(a) is identical to the coded arbitrary shape 
signal 1 500 according to the sixth embodiment shown in 
figure 10(a), that is, it is obtained by replacing the 1-bit 
shape identifier 502 in the coded arbitrary shape signal 
500 according to the first embodiment with a two-bit 
image identifier 2502. 

The coded binary shape signal 2600 shown in fig- 
ure 13(b) is identical to the coded binary shape signal 
1600 according to the sixth embodiment shown in figure 
10(b), that is, it is obtained by replacing the 1-bit shape 
identffier 602 in the coded binary shape signal 600 
according to the first embodiment with a two-bit image 
idenMier 2602. 

The coded pixel value signal 2700 shown in figure 
14(a) is identical to the coded pixel value signal 1700 
according to the sixth embodiment shown in figure 
10(c), that is, it is obtained by inserting a two-bit image 
identifier 2702 between the synchronous signal 701 and 
the header 703 in the coded pixel value signal 700a 
shown in figure 23(c). 

The second coded arbitrary shape signal 2800 
shown in figure 1 4(b) includes a 32-bit synchronous sig- 
nal 801 at the beginning, a two-bit image identifier (SID) 
2802 that follows the synchronous signal 801, and an 
additional header 803 that follows the image identifier 
2802. 

Further, the second coded arbitrary shape signal 
2800 includes coded shape bit streams 81 D obtained 
by coding a shape signal (binary transparency signal) 
that represents the shape of each object as a constitu- 
ent of the arbitrary shape image signal with transpar- 
ency information; coded texture bit streams (coded pixel 
value streams) 82D obtained by coding a texture signal 
(pixel value signal) that comprises a luminance signal 
and a color difference signal for color display of each 
object as a constituent of the arbitrary shape image sig- 
nal with transparency information; and coded transpar- 
ency bit streams 83D obtained by coding a multivalued 
transparency signal that represents the transparency of 
each object as a constituent of the arbitrary shape 
image signal with transparency information. 

To be specific, in the coded arbitrary shape signal 
2800, for each of plural blocks into which an object 
region including an object on a display image is divided, 
a coded shape bit stream 81 D, a coded texture bit 
stream 82D, and a coded transparency bit stream 83D 
are arranged in this order. 



More specifically, following the header 803, 
arranged are a coded shape bit stream 81 D1 , a coded 
texture bit stream 82D1 , and a coded transparency bit 
stream 83D1 corresponding to a block D1; a coded 

5 shape bit stream 81 D2, a coded texture bit stream 
82D2, and a coded transparency bit stream 83D2 corre- 
sponding to a block D2; and a coded shape bit stream 
81 D3, a coded texture bit stream 82D3, and a coded 
transparency bit stream 83D3 corresponding to a block 

10 D3. 

Further, as shown in figure 14(b), the coded shape 
bit streams 81 D1, 81 D2, and 81 D3 comprise variable- 
length coded data 804, 812, and 820 corresponding to 
shape motion vectors (in the figure, referred to simply as 

15 shape MV); and variable-length coded data 805, 813, 
and 821 corresponding to binary shape signals (binary 
transparency signals), each signal indicating whether 
pixels within the block are located inside the object or 
outside the object (in the figure, shape data), respec- 

20 tjvely. 

The coded texture bit streams 82D1, 82D2, and 
82D3 comprise 5-bit coded data 806, 814, and 822 cor- 
responding to quantization scales (in the figure, quanti- 
zation scale); variable-length coded data 807, 815, and 

25 823 corresponding to texture motion vectors (in the fig- 
ure, texture MV); and variable-length coded data 808, 
816, and 824 corresponding to quantized signals 
obtained by subjecting the texture signal to DCT and 
quantization (in the figure, texture DCT coefficients), 

30 respectively. 

The coded transparency bit streams 83D1, 83D2, 
and 83D3 comprise 5-bit coded data 809, 817, and 825 
corresponding to quantization scales (in the figure, 
quantization scale); variable-length coded data 810, 

35 818, and 826 corresponding to transparency motion 
vectors (in the figure, transparency MV); and variable- 
length coded data 811, 819, and 827 corresponding to 
quantized signals obtained by subjecting the multival- 
ued transparency signal to DCT and quantization (in the 

40 figure, transparency DCT coefficients), respectively. 

The synchronous signal 801 is a signal indicating 
the head of a coded arbitrary shape signal correspond- 
ing to one object, and this is a unique coded signal. The 
shape identifier (SID) 2802 is a signal indicating which 

45 type is the coded image signal among the above- 
described four types of coded image signals, i.e., it is a 
signal for identifying the kind of the bit streams included 
in the coded image signal. When the value of the image 
identifier (SID) is 01, it indicates that both of the coded 

so shape bit streams and the coded texture bit streams 
exist in the coded image signal. When SID = 10, the 
image identifier indicates that only the coded shape bit 
streams exist in the coded signal. When SID = 00, the 
image identifier indicates that only the coded texture bit 

55 streams exist in the coded signal. When SID is 1 1, the 
image identifier indicates that the coded shape bit 
streams, the coded texture bit streams, and the coded 
transparency bit streams exist in the coded signal. 
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The header 803 includes information relating to, for 
example, the time to display the image of the corre- 
sponding object, the attribute of the image, and the pre- 
diction mode for coding. However, these data have no 
relationship with the present invention and, therefore, 5 
do not require detailed description. 

The shape MV is the coded data of a motion vector 
which shows a motion of an image within a block 
between the present frame and the previous frame, and 
this is used for prediction of a shape signal of a specific 10 
block in the present frame from a shape signal of the 
corresponding block in the previous frame. Further, the 
shape data is obtained by subjecting a shape signal to 
arithmetic coding, and the quantization scale (in the 
coded texture bit stream) is a parameter for inversely 15 
quantizing DCT coefficients which are obtained by sub- 
jecting a texture signal to DCT and quantization. The 
texture MV is a coded signal of a texture motion vector 
which shows a motion of an image within a block 
between the present frame and the previous frame, and so 
this is used for prediction of a texture signal of the 
present frame from a texture signal of the previous 
frame. Further, the texture DCT coefficients are 
obtained by subjecting a quantized signal of the texture 
signal to variable-length coding. Further, the transpar- 25 
ency MV is a coded signal of a transparency motion 
vector which shows a motion of an image within a block 
between the present frame and the previous frame, and 
this is used for prediction of a multivalued transparency 
signal of the present frame from a multivalued transpar- 30 
ency signal of the previous frame. Furthermore, the 
quantization scale (in the coded transparency bit 
stream) is a parameter for inversely quantizing transpar- 
ency DCT coefficients which are obtained by subjecting 
a multivalued transparency signal to DCT and quantiza- 35 
tion. The transparency DCT coefficients are obtained by 
subjecting a quantized signal of the transparency signal 
to variable-length coding. 

In the coded arbitrary shape signal 2800, actually, 
various kinds of side information (data) are arranged 40 
before the texture DCT coefficients although these data 
are not shown in the figure, and these side data some- 
times require a large number of bits. 

A description is now given of the function and effect. 

I n the eighth embodiment so constructed , since the 45 
four coded image signals having different data struc- 
tures, i.e., the first coded arbitrary shape signal 2500 
corresponding to an arbitrary shape image signal, the 
coded binary signal 2600 corresponding to a binary 
image signal, the coded pixel value signal (coded rec- so 
tangle signal) 2700 corresponding to a rectangle image 
signal, and the second coded arbitrary shape signal 
2800 corresponding to an arbitrary shape image signal 
with transparency information, include the two-bit image 
identifiers 2502, 2602, 2702, and 2802 for identifying 55 
these coded signals, respectively. Therefore, those four 
coded signals can be identified at the decoding end by 
referring to the image identifiers. 



To be specific, in an image decoding apparatus 
adapted to MPEG4 and constructed to decode the 
coded arbitrary shape signal mentioned above, even 
when the coded binary signal and the coded rectangle 
signal are input besides the first and second coded arbi- 
trary shape signals, the image identifiers enable the 
apparatus to appropriately decode the coded binary sig- 
nal and the coded rectangle signal. Therefore, it is pos- 
sible to avoid that the coded binary signal and the coded 
rectangle signal are subjected to the decoding process 
adapted to MPEG4 and the decoding operation comes 
to a rupture. 

In this eighth embodiment of the invention, the first 
and second coded arbitrary shape signals, the coded 
binary signal, and the coded rectangle signal have the 
data structures obtained by inter-frame prediction cod- 
ing of an arbitrary shape image signal, an arbitrary 
shape image signal with transparency information, a 
binary image signal, and a rectangle image signal, 
respectively. However, these coded signals may have 
data structures obtained by intra-frame coding of the 
respective image signals. 

Furthermore, in the respective coded signals 
employed in this eighth embodiment, at least one of the 
coded shape bit stream, the coded texture bit stream, 
and the coded transparency bit stream is arranged for 
each block. However, in each coded signal, prescribed 
coded bit streams may be arranged for each frame 
(refer to figures 2(a) and 2(b)). 

Furthermore, in the data structure of the second 
coded arbitrary shape signal 2800 (coded signal of the 
arbitrary shape image signal with transparency informa- 
tion) shown in figure 14(b), each of the coded transpar- 
ency bit streams 83D includes a transparency MV and a 
quantization scale, like the coded texture bit streams 
82D. However, it is not always necessary for the coded 
transparency bit stream 83D in the second coded arbi- 
trary shape signal 2800 to have the transparency MV 
and the quantization scale. 

When the coded signal of the arbitrary shape image 
signal with transparency information has the data struc- 
ture in which the coded transparency bit streams 83D 
do not include transparency MVs and quantization 
scales, decoding of the transparency DCT coefficients 
is carried out using the texture MVs and the quantiza- 
tion scales of the coded texture bit streams 82D. 

[Embodiment 9] 

Figure 15 is a block diagram illustrating an image 
decoding apparatus according to a ninth embodiment of 
the present invention. In figure 15, reference numeral 
100f designates an image decoding apparatus which 
performs decoding of coded data having the data struc- 
ture for image transmission according to the eighth 
embodiment. The image decoding apparatus lOOf is 
constructed to perform a decoding process according to 
the data structure of a coded signal input as coded data. 
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More specifically, the apparatus lOOf is constructed 
as follows. When the apparatus 100f receives the coded 
binary signal (E2) 2600 shown in figure 13(b) as coded 
data Eo, it outputs a decoded binary signal D2 as 
decoded image data Dsyn. When the apparatus 100f 
receives the first coded arbitrary shape signal (Ep) 2500 
shown in figure 13(a) as coded data Eo, it outputs a 
decoded arbitrary shape signal Dp as decoded image 
data Dsyn, tor each object. When the apparatus 100f 
receives the coded rectangle signal (Et) 2700 shown in 
figure 14(a) as coded data Eo, it outputs a decoded rec- 
tangle signal Dt as decoded image data Dsyn. When 
the apparatus 1 0Of receives the second coded arbitrary 
shape signal (Ex) 2800 shown in figure 1 4(b) as coded 
data Eo, it outputs a decoded arbitrary shape signal Dx 
with transparency information as decoded image data 
Dsyn, for each object. 

Hereinafter, the structure of the image decoding 
apparatus 100f will be described in detail. 

The apparatus lOOf comprises a data analyzer I60f 
which analyzes the coded data Eo input to the input ter- 
minal 160a and outputs a switch control signal SWf 
according to the analysis result; a shape decoding unit 
(first decoding means) 1 70 which subjects the shape 
data (coded shape signal) to an arithmetic decoding 
process; a texture decoding unit (second decoding 
means) 180f which subjects the texture DCT coeffi- 
cients Ept and the transparency DCT coefficients Egt to 
a decoding process including inverse DCT; a switch 
1 01 f which supplies the coded signal analyzed by and 
output from the data analyzer 160f to one of the decod- 
ing units 170 and 180f, according to the switch control 
signal SWf; and a frame memory bank 102f which 
stores a prediction shape signal, a prediction texture 
signal, and a prediction transparency signal. 

The data analyzer 1601 analyzes the input coded 
signal to examine the 2-bit image identifier that follows 
the 32-bit synchronous signal at the head (any of the 
image identifiers 2502, 2602, 2702, and 2802 in the 
coded signals shown in figures 13(a), 13(b), 14(a), and 
14(b), respectively), and controls the switch 101f 
according to the set value of the image identifier (SID). 

When SID = 01 , the switch 101f is controlled by the 
switch control signal SWf so that the coded shape bit 
streams of the input coded signal are input to the shape 
decoding unit 170 and the coded texture bit streams of 
the coded signal are input to the texture decoding unit 
180f. When SID = 10, the switch 101 f is controlled by 
the switch control signal SWf so that the input coded 
data is input to the shape decoding unit 170. When SID 
= 00, the switch 101 f is controlled by the switch control 
signal SWf so that the coded texture bit streams of the 
input coded signal are input to the texture decoding 
means 180f. When SID = 1 1, the switch 101 f is control- 
led by the switch control signal SWf so that the coded 
shape bit streams of the input coded signal are input to 
the shape decoding unit 170, and the coded texture bit 
streams and the coded transparency bit streams are 



input to the texture decoding unit 1 801 

Like the data analyzer 160 according to the third 
embodiment shown in figure 7(b), the data analyzer 
160f comprises a table storage unit 162, a comparator 

5 161, and a switch control circuit 163. In this ninth 
embodiment, the comparator 161 compares the bit 
stream of the input coded signal with the bit stream of 
the decoding reference table stored in the table storage 
unit 162, and identifies the image identifier, coded 

jo shape bit stream, coded texture bit stream, and coded 
transparency bit stream, which are included in the 
coded data. According to the result of the identification, 
the comparator 161 outputs a switch control signal SWf 
toward the switch 101 f. 

15 The texture decoding unit (second decoding 
means) 180f includes a texture motion compensator 
184f which generates, according to the texture motion 
vector or the transparency motion vector, an address for 
obtaining a prediction texture signal or a prediction 

20 transparency signal, and obtains a prediction texture 
signal Emp or a prediction transparency signal Emg 
from the frame memory bank 102b using the address. 
Further, the texture decoding unit 180f includes an 
inverse quantizer 181f which receives the quantization 

25 scale and the DCT coefficients in the coded texture bit 
stream or the coded transparency bit stream, and sub- 
jects the DCT coefficients to inverse quantization; and 
an inverse cosine transformer 182f which subjects the 
inversely quantized DCT coefficients Diq to inverse 

30 cosine transformation. Furthermore, the texture decod- 
ing unit I80f includes an adder 183 which adds the pre- 
diction texture signal Emp or the prediction 
transparency signal Emg to the inversely cosine-trans- 
formed signal Didct, and outputs a reproduced texture 

35 signal (decoded texture signal) Dpt or a reproduced 
transparency signal (decoded transparency signal) 
Dpg. The reproduced texture signal Dpt or the repro- 
duced transparency signal Dpg from the adder 183 is 
stored in the frame memory bank 102f. 

40 The image decoding apparatus 100f further 
includes a synthesizer 190f which synthesizes the out- 
put of the decoding unit 180f (both of the reproduced 
texture signal Dpt and the reproduced transparency sig- 
nal Dpg, or only the reproduced texture signal Dpt) and 

45 the output of the decoding unit 170 (the reproduced 
shape signal Dk corresponding to the output of the 
decoding unit 180f) with desired image data, and out- 
puts synthesized image data Dsyn including the repro- 
duced arbitrary shape signal Dx with transparency 

so information or the reproduced arbitrary shape signal Dp 
toward the display unit 104. Although the synthesizer 
190f according to the ninth embodiment outputs the 
reproduced binary signal D2 from the decoding unit 170 
or the reproduced rectangle signal Dt from the decoding 

55 unit 180f directly to the display unit 104, the signal D2 or 
Dt may be synthesized with another arbitrary shape 
image signal in the synthesizer 1901 Although the sig- 
nal output from the synthesizer 1 90f is input to the dis- 
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play unit 1 04, it may be input to an information output 
apparatus such as a printer (not shown). 

In this ninth embodiment, with respect to pixels 
wherein the reproduced shape signal is zero, pixel val- 
ues in the reproduced texture signal and the reproduced 5 
transparency signal are replaced with pixel values of a 
prescribed image. The prescribed image is an image 
prepared in advance at the receiving end, or an image 
reproduced by another image decoding apparatus. 

A description is given of the operation of the image w 
decoding apparatus 100f according to the ninth embod- 
iment. 

Figure 1 6 is a flowchart of the decoding process by 
the image decoding apparatus 100f. When a coded 
image signal having a data structure shown in any of fig- is 
ures 13(a), 13(b), 14(a), and 14(b) is input to the image 
decoding apparatus 100f, the data analyzer 160f ana- 
lyzes the 2-bit image identifier that follows the 32-bit 
synchronous signal in the coded image signal, and 
decides whether the value of the image identifier is "SID 20 
= 00 or 11" or not (Step Sf1). 

When the result of the decision is SID = 00 or 11, it 
is decided whether SID = 00 or not (Step Sf2). When the 
result of the decision in Step Sf2 is SID = 00, since the 
input coded image signal Eo is the coded rectangle sig- 25 
nal Et, the data analyzer I60f controls the switch 101f 
with the switch control signal SWf so that the coded sig- 
nal is always input to the texture decoding unit 1801 
Thereby, in the texture decoding unit 180f, the coded 
texture bit streams in the coded rectangle signal are so 
subjected to decoding (Step Sf3). The decoding proc- 
ess in Step Sf3 is identical to the decoding process in 
Step Se2 according to the seventh embodiment. Then, 
the decoded rectangle signal is sent through the synthe- 
sizer 1 90f to the display unit 1 04 to be displayed (Step 35 
Sf13). In the synthesizer 1901, the decoded rectangle 
signal Dg may be synthesized with another arbitrary 
shape image signal. 

Thereafter, it is decided whether the coded rectan- 
gle signal Et is data of the last block of the last frame to 
(Step Sf14). When the coded rectangle signal is not 
data of the last block of the last frame, a coded rectan- 
gle signal corresponding to the next block is subjected 
to Steps Sf1~Sf3, Sf13, and Sf14. Conversely, when the 
coded rectangle signal is data of the last block of the 45 
last frame, decoding of the coded rectangle signal is 
ended. 

On the other hand, when the result of the decision 
in Step Sf2 is SID = 1 1 , the input coded image signal is 
the second coded arbitrary shape signal Ex including, so 
as image information, the coded shape bit streams, the 
coded texture bit streams, and the coded transparency 
bit streams. So, the data analyzer 160f controls the 
switch 101 f with the switch control signal SWf so that 
the coded shape bit stream corresponding to each block 55 
of the coded arbitrary shape signal Ex with transpar- 
ency signal is input to the shape decoding unit 1 70 while 
the coded texture bit stream and the coded transpar- 



ency bit stream corresponding to the block are input to 
the texture decoding unit 1 80f. Thereby, in the second 
coded arbitrary shape signal Ex, the coded shape bit 
stream is separated from the coded texture bit stream 
and the coded transparency bit stream (Step Sf4), and 
the separated shape bit stream is decoded by the shape 
decoding unit 170 (Step Sf5) while the separated tex- 
ture bit stream and the transparency bit stream are 
decoded by the texture decoding unit 180f (Steps St6 
and Sf7). 

To be specific, the coded shape bit stream Epk is 
decoded by the shape decoder 170, like the coded 
shape bit stream E2k of the binary image signal. When 
decoding of the coded shape bit stream Epk corre- 
sponding to one block has ended, the arithmetic 
decoder 171 detects the end of the coded shape bit 
stream Epk corresponding to this block and outputs an 
end detection signal Te toward the data analyzer 1 60f. 
Receiving the end detection signal Te, the data analyzer 
160f controls the switch 101b with the switch control sig- 
nal SWb so that the input coded image signal is input to 
the texture decoding unit 180f. 

Thereby, the coded texture bit stream Ept and the 
coded transparency bit stream Epg are sent from the 
data analyzer 160f to the texture decoding unit 180f. In 
the decoding unit 180f, the quantization scale and the 
DCT coefficients of each bit stream are input to the 
inverse quantizer 181f, and the DCT coefficients are 
subjected to inverse quantization. Thereafter, the 
inversely quantized signal Diq is subjected to inverse 
cosine transformation by the inverse cosine transformer 
182f. 

Meanwhile, the texture motion vector MV and the 
transparency motion vector MV are input to the texture 
motion compensator 184f. The compensator 184f gen- 
erates addresses for obtaining a prediction texture sig- 
nal and a prediction transparency signal, according to 
these motion vectors MV, and obtains a prediction tex- 
ture signal Emp and a prediction transparency signal 
Emg from the frame memory bank 102b using the 
addresses. 

The adder 1 83 performs addition of the output Didct 
from the inverse cosine transformer 182f and the predic- 
tion texture signal Emp or the prediction transparency 
signal Emg, and outputs a decoded texture signal Dpi or 
a decoded transparency signal Dpg. These decoded 
signals Dpt and Dpg are input to the frame memory 
bank 102f and the synthesizer 1901 In the synthesizer 
190f, the reproduced texture signal Dpt, the reproduced 
transparency signal Dpg, and the corresponding repro- 
duced shape signal Dk are synthesized with a desired 
image to generate synthesized image data Dsyn. When 
decoding of the coded transparency bit stream has 
ended, the data analyzer 160f detects the end of the 
coded transparency bit stream, and controls the switch 
101f by the switch control signal SWb so that the input 
coded image signal is input to the shape decoding unit 
170. 
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Further, ihe reproduced and synthesized data 
(decoded arbitrary shape signal with transparency infor- 
mation) Dx is output from the synthesizer 190f to the 
display unit 104 for image display (Step Sf13). 

Thereafter, it is decided whether the second coded 5 
arbitrary shape signal is data of the last block of the last 
frame (Step Sf14). When it is not data of the last block 
of the last frame, a second coded arbitrary shape signal 
corresponding to the next block is subjected to Steps 
Sf1, Sf2, Sf4~Sf7, Sf13, and Sf14. Conversely, when it 10 
is data of the last block of the last frame, decoding of the 
second coded arbitrary shape signal is ended. 

When it is decided in Step Sf 1 that the value of the 
image identifier is not "SID = 00 or 11", it is decided in 
Step Sf8 whether SID = 1 0 or not. When the result of the is 
decision in Step Sf8 is SID = 10, since the input coded 
image signal Eo is the coded binary signal E2 that 
includes only the coded shape bit streams as image 
information, the data analyzer 160f controls the switch 
101 f with the switch control signal SWf so that the 20 
coded shape bit stream corresponding to each block in 
the coded binary signal is always input to the shape 
decoding unit 170. Thereby, in the shape decoding unit 
170, the coded shape bit streams in the coded binary 
signal are subjected to decoding (Step Sf9). The decod- 25 
ing process in Step Sf9 is identical to the decoding proc- 
ess in Step Sb2 according to the third embodiment. 
Then, the decoded binary signal D2 is sent through the 
synthesizer 190f to the display unit 104 to be displayed 
(Step Sf 1 3). In the synthesizer 1 90f, the decoded binary 30 
signal D2 may be synthesized with another arbitrary 
shape image signal. 

Thereafter, it is decided whether the coded binary 
signal is data of the last block of the last frame (Step 
S114). When it is not data of the last block of the last ss 
frame, a coded binary signal corresponding to the next 
block is subjected to Steps Sf1, Sf8, Sf9, Sf13, and 
Sf 1 4. Conversely, when it is data of the last block ot the 
last frame, decoding of the coded binary signal is 
ended. 40 

On the other hand, when the result of the decision 
in Step Sf8 is SID = 01, the input coded image signal is 
the first coded arbitrary shape signal including, as 
image information, the coded shape bit streams and the 
coded texture bit streams. So, the data analyzer 160f 45 
controls the switch 101f with the switch control signal 
SWf so that the coded shape bit stream corresponding 
to each block of the coded arbitrary shape signal is input 
to the shape decoding unit 1 70 while the coded texture 
bit stream corresponding to the block is input to the tex- so 
ture decoding unit 180f. Thereby, in the first coded arbi- 
trary shape signal, the coded shape bit stream and the 
coded texture bit stream are separated from each other 
(Step Sf10), and the separated shape bit stream is 
decoded by the shape decoding unit 170 (Step SM1) 55 
while the separated texture bit stream is decoded by the 
texture decoding unit 180f (Step Sf12). Further, these 
streams are displayed by the display unit 104 (Step 



Sf13). The decoding processes in Steps Sf1 1 and Sf12 
are identical to the decoding processes in Steps Sb4 
and Sb5 according to the third embodiment, respec- 
tively, and the image display in Step Sf13 is identical to 
the image display in Step Sb6 according to the third 
embodiment. 

Thereafter, it is decided whether the first coded 
arbitrary shape signal Ep is data of the last block of the 
last frame (Step Sf14). When it is not data of the last 
block of the last frame, a first coded arbitrary shape sig- 
nal Ep corresponding to the next block is subjected to 
Steps Sfl, Sf8, and Sf10~Sf14. Conversely, when it is 
data of the last block of the last frame, decoding of the 
first coded arbitrary shape signal Ep is ended. 

In the decoding process according to this ninth 
embodiment, when decoding the first or second coded 
arbitrary shape image signal, with respect to pixels 
wherein the reproduced shape signal is zero, pixel val- 
ues in the reproduced texture signal and the reproduced 
transparency signal are replaced with pixel values of a 
prescribed image. The prescribed image is an image 
prepared in advance at the receiving end, or an image 
reproduced by another image decoding apparatus. 

As described above, according to the ninth embod- 
iment of the invention, the image decoding apparatus 
100f includes the data analyzer 160f for analyzing the 
input coded data. The data analyzer 160f detects an 
image identifier indicating which coded signal is the 
input coded image signal among the coded binary sig- 
nal E2, the first coded arbitrary shape signal Ep, the 
second coded arbitrary shape signal Ex, and the coded 
rectangle signal Eg and, according to the value of the 
image identifier, the analyzer 160f controls the switch 
101f so that the bit streams in the input coded signal are 
appropriately input to one of the shape decoding unit 
170 and the texture decoding unit 180f. Therefore, the 
above-mentioned four coded image signals having dif- 
ferent data structures can be decoded by decoding 
processes corresponding to a single coding method. 

In this ninth embodiment of the invention, the coded 
shape bit stream is decoded by the arithmetic decoding 
method (first decoding method) in the shape decoding 
unit 170, and the coded texture bit stream and the 
coded transparency bit stream are decoded by the 
same decoding method including DCT. In other words, 
the decoding method for the coded texture bit stream 
(second decoding method) is identical to the decoding 
method for the coded transparency bit stream (third 
decoding method). However, the coded transparency bit 
stream may be decoded by a decoding method includ- 
ing wavelet processing or the like (third decoding 
method) which is different from the decoding method 
including DCT (second decoding method). 

In this ninth embodiment of the invention, emphasis 
has been placed on the image decoding apparatus 100f 
which identifies the coded signals having the data struc- 
ture described for the eighth embodiment, i.e., first and 
second coded arbitrary shape signals, a coded binary 
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signal, and a coded rectangle signal, using their image 
identifiers, and performs decoding processes according 
to the respective data structures. However, when the 
image input unit 110 in the image coding apparatus 
1 00a according to the second embodiment shown in fig- 
ure 3 is constructed so that it can identify an arbitrary 
shape image signal, a binary image signal, a rectangle 
image signal, and an arbitrary shape image signal with 
transparency information, it is possible to implement an 
image coding apparatus that performs coding of these 
image signals with image identifiers for identifying the 
coded signals corresponding to these image signals. 

[Embodiment 10] 

Figures 1 7(a)-1 7(c) and figures 18(a)-18(b) aredia- 
grams for explaining data structures for image transmis- 
sion according to a tenth embodiment of the present 
invention. More specifically, figure 17(a) shows a data 
structure of a coded arbitrary shape signal (first coded 
arbitrary shape signal) 3500 obtained by coding an arbi- 
trary shape image signal including both of coded shape 
bit streams and coded texture bit streams, figure 1 7(b) 
shows a data structure of a coded binary signal 3600 
obtained by coding a binary image signal, and figure 
17(c) shows a data structure of a coded rectangle signal 
(coded pixel value signal) 3700 obtained by coding a 
rectangle image signal which includes, as display data, 
only a texture signal (pixel value signal) for color display 
of an image. Further, figure 18(a) shows a data struc- 
ture of a coded arbitrary shape signal with transparency 
information (second coded arbitrary shape signal) 3800 
obtained by coding an arbitrary shape image signal 
including transparency information, and figure 18(b) 
shows a data structure of a coded transparency signal 
3900 obtained by coding an arbitrary shape transpar- 
ency signal including both of coded shape bit streams 
and coded transparency bit streams. 

These data structures for image transmission 
according to the tenth embodiment are applied to a sys- 
tem in which the arbitrary shape image signal including 
transparency information is compressively coded and 
transmitted and, further, the second coded arbitrary 
shape signal obtained by the compressive coding is 
decoded and displayed. That is, in this system, the 
image coding method and apparatus are constructed to 
perform a coding process for the arbitrary shape image 
signal including transparency information while the 
image decoding method and apparatus are constructed 
to perform a decoding process for the second coded 
arbitrary shape signal. 

The first coded arbitrary shape signal 3500 shown 
in figure 17(a) is identical to the coded arbitrary shape 
signal 2500 according to the eighth embodiment shown 
in figure 13(a), that is, it is obtained by replacing the 1- 
bit shape identifier 502 in the coded arbitrary shape sig- 
nal 500 according to the first embodiment with a two-bit 
image identifier 3502. 



The coded binary shape signal 3600 shown in fig- 
ure 1 7(b) is identical to the coded binary shape signal 
2600 according to the eighth embodiment shown in fig- 
ure 13(b), that is, it is obtained by replacing the 1-bit 

5 shape identifier 602 in the coded binary shape signal 
600 according to the first embodiment with a two-bit 
image identifier 3602. 

The coded rectangle signal (coded pixel value sig- 
nal) 3700 shown in figure 1 7(c) is identical to the coded 

10 rectangle signal 2700 according to the eighth embodi- 
ment shown in figure 14(a), that is, it is obtained by 
inserting a two-bit image identifier 3702 between the 
synchronous signal 701 and the header 703 in the 
coded pixel value signal 700a shown in figure 23(c). 

75 The second coded arbitrary shape signal 3800 
shown in figure 18(a) is identical to the second coded 
arbitrary shape signal 2800 according to the eighth 
embodiment shown in figure 14(b), and has a two-bit 
image identifier 3802. 

20 Further, the coded transparency signal shown in 
figure 18(b) includes a 32 -bit synchronous signal 901 at 
the beginning, a 3-bit image identifier (SID) 3902 that 
follows the synchronous signal 901, and an additional 
header 903 that follows the identifier 3902. 

25 Further, the coded transparency signal 3900 
includes coded shape bit streams 91 E obtained by cod- 
ing a shape signal (binary transparency signal) that rep- 
resents the shape of each object as a constituent of the 
arbitrary shape transparency signal; and coded trans- 

so parency bit streams 92E obtained by coding a multival- 
ued transparency signal used for gradation display of 
the transparency of each object as a constituent of the 
arbitrary shape image signal. To be specific, in the 
coded transparency signal 3900, for each of plural 

35 blocks into which an object region including an object on 
a display image is divided, a coded shape bit stream 
91 E and a coded transparency bit stream 92E are 
arranged in this order. 

More specifically, following the header 903, 

40 arranged are a coded shape bit stream 91 E1 and a 
coded transparency bit stream 92E1 corresponding to a 
block E1 ; a coded shape bit stream 91 E2 and a coded 
transparency bit stream 92E2 corresponding to a block 
E2; and a coded shape bit stream 91 E3 and a coded 

45 transparency bit stream 92E3 corresponding to a block 
E3. 

Further, as shown in figure 18(b), the coded shape 
bit streams 91 E1, 91 E2, and 91 E3 comprise variable- 
length coded data 904, 909, and 914 corresponding to 

so shape motion vectors (in the figure, referred to simply as 
shape MV); and variable-length coded data 905, 910, 
and 915 corresponding to binary shape signals (binary 
transparency signals), each signal indicating whether 
pixels within the block are located inside the object or 

55 outside the object (in the figure, shape data), respec- 
tively. 

The coded transparency bit streams 92E1, 92 E2. 
and 92E3 comprise 5-bit coded data 906, 91 1 , and 916 
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corresponding to quantization scales (in the figure, 
quantization scale); variable-length coded data 907, 
912, and 917 corresponding to transparency motion 
vectors (in the figure, transparency MV); and variable- 
length coded data 908, 913, and 918 corresponding to 5 
quantized signals obtained by subjecting the multival- 
ued transparency signal to DCT and quantization (in the 
figure, transparency DCT coefficients), respectively. 

The synchronous signal 901 is a signal indicating 
the head of a coded transparency signal corresponding 10 
to one object, and this is a unique coded signal. The 
image identifier (SID) 3902 is a signal indicating which 
type is the coded image signal among the above-men- 
tioned five types of coded image signals, i.e., it is a sig- 
nal for identifying the kind of the bit streams included in 15 
the coded image signal. When the value of the image 
identifier (SID) is 10, the identifier indicates that both of 
the coded shape bit streams and the coded texture bit 
streams exist, as display data, in the coded image sig- 
nal. When SID = 010, the image identifier indicates that 20 
only the coded shape bit streams exist in the coded sig- 
nal. When SID = 00, the image identifier indicates that 
only the coded texture bit streams exist in the coded sig- 
nal. When SID is 11, the image identifier indicates that 
the coded shape bit streams, the coded texture bit 25 
streams, and the coded transparency bit streams exist 
in the coded signal. Further, when SID = 01 1 , the image 
identifier indicates that the coded shape bit streams and 
the coded transparency bit streams exist in the coded 
signal. 30 

The header 903 includes information relating to, for 
example, the time to display the image of the corre- 
sponding object, the attribute of the image, and the pre- 
diction mode for coding. However, these data have no 
relationship with the present invention and, therefore, 3S 
do not require detailed description. 

The shape MV is the coded data of a motion vector 
which shows a motion of an image within a block 
between the present frame and the previous frame, and 
this is used for prediction of a shape signal of a specific 40 
block in the present frame from a shape signal of the 
corresponding block in the previous frame. Further, the 
shape data is obtained by subjecting a shape signal to 
arithmetic coding, and the quantization scale is a 
parameter for inversely quantizing DCT coefficients 45 
which are obtained by subjecting a multivalued trans- 
parency signal to DCT and quantization. The transpar- 
ency MV is a coded signal of a transparency motion 
vector which shows a motion of an image within a block 
between the present frame and the previous frame, and so 
this is used for prediction of a multivalued transparency 
signal of the present frame from a multivalued transpar- 
ency signal of the previous frame. Further, the transpar- 
ency DCT coefficients are obtained by subjecting a 
quantized signal of the multivalued transparency signal 55 
to variable-length coding. 

In the coded transparency signal 3900, actually, 
various kinds of side information (data) are arranged 



before the transparency DCT coefficients although 
these data are not shown in the figure, and these side 
data sometimes require a large number of bits. 

A description is now given of the function and effect. 

In the tenth embodiment so constructed, since the 
five coded image signals having different data struc- 
tures, i.e., the first coded arbitrary shape signal 3500, 
the coded binary signal 3600, the coded rectangle sig- 
nal 3700, the second coded arbitrary shape signal 
3800, and the coded transparency signal 3900 include 
the image identifiers 3502, 3602, 3702, 3802, and 3902 
for identifying these coded signals, respectively. There- 
fore, these five coded signals can be identified at the 
decoding end by referring to the image identifiers. 

To be specific, in an image decoding apparatus 
adapted to MPEG4 and constructed to decode the 
coded arbitrary shape signal mentioned above, even 
when the coded binary signal and the coded rectangle 
signal are input besides the first and second coded arbi- 
trary shape signals and the arbitrary shape transpar- 
ency signal, the image identifiers enable the apparatus 
to appropriately decode the coded binary signal and the 
coded rectangle signal. Therefore, it is possible to avoid 
that the coded binary signal and the coded rectangle 
signal are subjected to the decoding process adapted to 
MPEG4 and the decoding operation comes to a rupture. 

While in this tenth embodiment the respective 
coded signals having different data structures are 
obtained by inter-frame prediction coding, these coded 
signals may be obtained by intra-frame coding. 

Furthermore, in the respective coded signals 
employed in this tenth embodiment, at least one of the 
coded shape bit stream, the coded texture bit stream, 
and the coded transparency bit stream is arranged for 
each block. However, in each coded signal, prescribed 
coded bit streams may be arranged for each frame. 

Furthermore, in the data structure of the second 
coded arbitrary shape signal 3800 (coded signal of the 
arbitrary shape image signal with transparency informa- 
tion) shown in figure 18(a), each of the coded transpar- 
ency bit streams 83D includes a transparency MV and a 
quantization scale, as in the eighth embodiment of the 
invention. However, it is not always necessary for the 
coded transparency bit stream 83D in the second coded 
arbitrary shape signal 3800 to have the transparency 
MV and the quantization scale, and decoding of the 
transparency DCT coefficients may be carried out using 
the texture MV and the quantization scale in the coded 
texture bit stream 82D. 

[Embodiment 11] 

Figure 19 is a block diagram illustrating an image 
decoding apparatus according to an eleventh embodi- 
ment of the present invention. In figure 11, reference 
numeral lOOg designates an image decoding apparatus 
which performs decoding of coded data having the data 
structure for image transmission according to the tenth 
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embodiment. The image decoding apparatus 100g is 
constructed to perform a decoding process according to 
the data structure of a coded signal input as coded data. 

More specifically, the apparatus 100g is con- 
structed as follows. When the apparatus 100g receives 5 
the coded binary signal (E2) 3600 shown in figure 17(b) 
as coded data Eo, it outputs a decoded binary signal D2 
as decoded image data Dsyn. When the apparatus 
1 0Og receives thef irst coded arbitrary shape signal (Ep) 
3500 shown in figure 1 7(a) as coded data Eo, it outputs 10 
a decoded arbitrary shape signal Dp as decoded image 
data Dsyn, for each object. When the apparatus lOOg 
receives the coded rectangle signal (Et) 3700 shown in 
figure 17(c) as coded data Eo, it outputs a decoded rec- 
tangle signal Dt as decoded image data Dsyn. When 15 
the apparatus 100g receives the second coded arbitrary 
shape signal (Ex) 3800 shown in figure 18(a) as coded 
data Eo, it outputs a decoded arbitrary shape signal Dx 
with transparency information as decoded image data 
Dsyn, for each object. When the apparatus 100g so 
receives the coded transparency signal (Ep) 3900 
shown in figure 18(b), it outputs a decoded transpar- 
ency signal Dg as decoded image data Dsyn for each 
object. 

Hereinafter, the structure of the image decoding 25 
apparatus 100g will be described in detail. 

The apparatus 100g comprises a data analyzer 
1 60g which analyzes the coded data Eo input to the 
input terminal 160a and outputs a switch control signal 
SWg according to the analysis result; a shape decoding 30 
unit (first decoding means) 170 which subjects the 
shape data (coded shape signal) to an arithmetic 
decoding process; a texture decoding unit (second 
decoding means) 1 80f which subjects the texture DCT 
coefficients Ept and the transparency DCT coefficients 35 
Egt to a decoding process including inverse DCT; a 
switch lOlg which supplies the coded signal analyzed 
by and output from the data analyzer 160g to one of the 
decoding units 170 and 180f, according to the switch 
control signal; and a frame memory bank 102f which 40 
stores a prediction shape signal, a prediction texture 
signal, and a prediction transparency signal. 

The data analyzer 160g analyzes the input coded 
signal to examine the 2-bit image identifier that follows 
the 32-bit synchronous signal at the head (i.e., any of 45 
the image identifiers 3502, 3702, and 3802 in the coded 
signals shown in figures 17(a), 17(c), and 18(a), respec- 
tively), and controls the switch 1 01g according to the set 
value of the image identifier (SID). 

When the set value of the image identifier (SID) is so 
10, the switch 101g is controlled by the switch control 
signal SWg so that the coded shape bit streams of the 
input coded signal are input to the shape decoding unit 
1 70 and the coded texture bit streams of the coded sig- 
nal are input to the texture decoding unit 180f. When 55 
SID = 010, the switch 101 g is controlled by the switch 
control signal SWg so that the input coded data is input 
to the shape decoding unit 170. When SID = 00, the 



switch 101g is controlled by the switch control signal 
SWg so that the coded texture bit streams of the input 
coded signal are input to the texture decoding means 
180f. When SID = 11, the switch 101g is controlled by 
the switch control signal SWg so that the coded shape 
bit streams of the input coded signal are input to the 
shape decoding unit 170, and the coded texture bit 
streams and the coded transparency bit streams are 
input to the texture decoding unit 1 801 When SID = 01 1 , 
the switch lOlg is controlled by the switch control signal 
SWg so that Ihe coded shape bit streams in the input 
coded signal are input to the shape decoding unit 1 70 
while the coded transparency bit streams are input to 
the texture decoding unit 180f. 

Like the data analyzer 160 according to the third 
embodiment shown in figure 7(b), the data analyzer 
160g comprises a table storage unit 162, a comparator 
161, and a switch control circuit 163. In this eleventh 
embodiment, the comparator 161 compares the bit 
stream of the input coded signal with the bit stream of 
the decoding reference table stored in the table storage 
unit 162, and identifies the image identifier, coded 
shape bit stream, coded texture bit stream, and coded 
transparency bit stream, which are included in the 
coded data. According to the result of the identification, 
the comparator 161 controls the switch 101g with the 
switch control signal SWg. 

The image decoding apparatus 100g further 
includes a synthesizer 190g which synthesizes the out- 
put of the decoding unit 1 80f (both of the reproduced 
texture signal Dpt and the reproduced transparency sig- 
nal Dpg, or only the reproduced texture signal Dpt or the 
reproduced transparency signal Dgt) and the output of 
the decoding unit 170 (the reproduced shape signal Dk 
corresponding to the output of the decoding unit 180f) 
with desired image data, and outputs synthesized 
image data Dsyn including one of the reproduced arbi- 
trary shape signal Dx with transparency information, the 
reproduced arbitrary shape signal Dp, and the repro- 
duced transparency signal Dg, toward the display unit 
104. Although the synthesizer 190g according to the 
eleventh embodiment outputs the reproduced binary 
signal D2 from the decoding unit 170 or the reproduced 
rectangle signal Dt from the decoding unit 1 80f directly 
to the display unit 104, the signal D2 or Dt may be syn- 
thesized with another arbitrary shape image signal in 
the synthesizer 190g. Although the signal output from 
the synthesizer 190g is input to the display unit 104, it 
may be input to an information output apparatus such 
as a printer (not shown). Other constituents are identical 
to those already described for the ninth embodiment. 

In this eleventh embodiment, with respect to pixels 
wherein the reproduced shape signal is zero, pixels val- 
ues in the reproduced texture signal are replaced with 
pixel values of a prescribed image. The prescribed 
image is an image prepared in advance at the receiving 
end, or an image reproduced by another image decod- 
ing apparatus. 
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A description is given of the operation of the image 
decoding apparatus 100g according to the eleventh 
embodiment. 

Figure 20 is a flowchart of the decoding process by 
the image decoding apparatus 100g. When a coded 5 
image signal having the data structure shown in any of 
figures 17(a), 17(b), 17(c), 18(a), and 18(b) is input to 
the image decoding apparatus 100g, the data analyzer 
160g analyzes the 2-bit code that follows the 32-bit syn- 
chronous signal in the coded image signal, and decides 
whether this 2-bit code is 01 or not (Step Sg1). When 
the 2-bit code is not 01 , it is further decided whether the 
2-bit code is, as the value of the image identifier, "SID = 
00 or 1 1" or not (Step Sg2). 

When the result of the decision in Step Sg2 is SID 
= 00 or 1 1 , it is decided whether SID = 00 or not (Step 
Sg3). When the result of the decision in Step Sg3 is SID 
= 00, since the input coded image signal Eo is the 
coded rectangle signal (coded pixel value signal) Et, the 
data analyzer 160g controls the switch 101g with the 
switch control signal SWg so that the coded signal is 
always input to the texture decoding unit 180f. Thereby, 
in the texture decoding unit 180f, the coded texture bit 
streams (coded pixel value bit streams) in the coded 
rectangle signal are subjected to decoding (Step Sg4). 
The decoding process in Step Sg4 is identical to the 
decoding process in Step Se2 according to the seventh 
embodiment. Thereafter, the decoded rectangle signal 
Dt is sent through the synthesizer 190g to the display 
unit 104 to be displayed as an image (Step Sg18). In the 
synthesizer 190g, the decoded rectangle signal Dt may 
be synthesized with another arbitrary shape image sig- 
nal. 

Next, it is decided whether the coded rectangle sig- 
nal Et is data of the last block of the last frame (Step 
Sg19) When the coded rectangle signal is not data of 
the last block of the last frame, a coded rectangle signal 
Et corresponding to the next block is subjected to Steps 
Sgl~Sg4, Sg18, and Sg19. Conversely, when the 
coded rectangle signal is data of the last block of the 
last frame, decoding of the coded rectangle signal is 
ended. 

On the other hand, when the result of the decision 
in Step Sg3 is SID = 1 1 , the input coded image signal is 
the second coded arbitrary shape signal including, as 
image information, the coded shape bit streams, the 
coded texture bit streams, and the coded transparency 
bit streams. So, the data analyzer 160g controls the 
switch 101 g with the switch control signal SWg so that 
the coded shape bit stream corresponding to each block 
of the second coded arbitrary shape signal is input to 
the shape decoding unit 170 while the coded texture bit 
stream and the coded transparency bit stream corre- 
sponding to the block are input to the texture decoding 
unit 180f. Thereby, in the second coded arbitrary shape 
signal, the coded shape bit stream is separated from the 
coded texture bit stream and the coded transparency bit 
stream (Step Sg5), and the separated shape bit stream 



is decoded by the shape decoding unit 1 70 (Step Sg6) 
while the separated texture bit stream and transparency 
bit stream are decoded by the texture decoding unit 
180f (Steps Sg7 and Sg8). The processes in Steps 
Sg5~Sg8 are identical to those in Steps Sf4~Sf7 
according to the ninth embodiment. The decoded shape 
signal Dxk, the decoded texture signal Dxt, and the 
decoded transparency signal Dgt are synthesized by 
the synthesizer 190g, and the second decoded arbitrary 
shape signal Dx, as the synthesized signal Dsyn, is 
input to the display unit 1 04 and displayed as an image 
(Step Sg18). 

Thereafter, it is decided whether the second coded 
arbitrary shape signal Ex is data of the last block of the 
last frame (Step Sg19). When it is not data of the last 
block of the last frame, a second coded arbitrary shape 
signal Ex corresponding to the next block is subjected to 
Steps Sg1~Sg3, Sg5~Sg8, Sg18, and Sg19. Con- 
versely, when it is data of the last block of the last frame, 
decoding of the second coded arbitrary shape signal is 
ended. 

When it is decided in Step Sg2 that SID = 10, the 
input coded image signal Eo is the first coded arbitrary 
shape signal Ep including, as image information, the 
coded shape bit streams and the coded texture bit 
streams. So, the data analyzer 160g controls the switch 
10lg with the switch control signal SWg so that the 
coded shape bit stream corresponding to each block of 
the coded arbitrary shape signal is input to the shape 
decoding unit 170 while the coded texture bit stream 
corresponding to the block is input to the texture decod- 
ing unit 180f. Thereby, in the coded arbitrary shape sig- 
nal, the coded shape bit stream and the coded texture 
bit stream are separated from each other (Step Sg9), 
and the separated shape bit stream is decoded by the 
shape decoding unit 170 (Step SglO) while the sepa- 
rated texture bit stream is decoded by the texture 
decoding unit 180f (Steps Sg11). The decoding proc- 
esses in Steps Sg9~Sg11 are identical to those in 
Steps Sf10~Sf12 according to the ninth embodiment. 

The decoded shape signal Dpk and the decoded 
texture signal Dpi are synthesized by the synthesizer 
190g, and the first decoded arbitrary shape signal Dp, 
as the synthesized signal Dsyn, is input to the display 
unit 104 and displayed as an image (Step Sg18). 

Thereafter, it is decided whether the first coded 
arbitrary shape signal Ep is data of the last block of the 
last frame (Step Sg19). When it is not data of the last 
block of the last frame, a first coded arbitrary shape sig- 
nal Ep corresponding to the next block is subjected to 
Steps Sg1, Sg2, Sg9~Sg1l, Sg18, and Sg19. Con- 
versely, when it is data of the last block of the last frame, 
decoding of the first coded arbitrary shape signal is 
ended. 

Furthermore, when the result of the decision in 
Step Sgl is that the 2-bit code following the synchro- 
nous signal is 01 , it is decided in Step Sg12 whether the 
value of the image identifier (SID) is 010 or not. When 
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SID = 010, since the input coded image signal Eo is the 
coded binary signal E2 that includes only the coded 
shape bit streams as image information, the data ana- 
lyzer 160g controls the switch 101g with the switch con- 
trol signal SWg so that the coded shape bit stream 
corresponding to each block in the coded binary signal 
is always input to Ihe shape decoding unit 170. Thereby, 
in the shape decoding unit 170, the coded shape bit 
streams in the coded binary signal are subjected to 
decoding (Step Sg13). The decoding process in Step 
Sg13 is identical to the decoding process in Step Sb2 
according to the third embodiment. Thereafter, the 
decoded binary signal D2 is sent through the synthe- 
sizer 190g to the display unit 104 to be displayed (Step 
Sg18). In the synthesizer 190g, the decoded binary sig- 
nal D2 may be synthesized with another arbitrary shape 
image signal. 

Thereafter, it is decided whether the coded binary 
signal is data of the last block of the last frame (Step 
Sgl9). When it is not data of the last block of the last 
frame, a coded binary signal corresponding to the next 
block is subjected to Steps Sg1, Sg12, Sg13, Sg18, and 
Sg19. Conversely, when it is data of the last block of the 
last frame, decoding of the coded binary signal is 
ended. 

On the other hand, when the result of the decision 
in Step Sg12 is that SID is not 010, it is decided in Step 
SgH whether SID is 01 1 or not. When SID is not 011, 
the decoding process by the image decoding apparatus 
100g returns toStepSgl. 

When the result of the decision in Step Sg14 is SID 
= 011, since the input coded image signal Eo includes, 
as image information, the coded shape bit streams Egk 
and the coded transparency bit streams Egt, the data 
analyzer 160g controls the switch 101g with the switch 
control signal SWg so that the coded shape bit stream 
Egk corresponding to each block of the coded arbitrary 
shape signal is input to the shape decoding unit 170 
while the coded transparency bit stream Egt corre- 
sponding to the block is input to the texture decoding 
unit 180f. Thereby, in the coded transparency signal, the 
coded shape bit stream Egk and the coded texture bit 
stream Egt are separated from each other (Step Sg15), 
and the separated shape bit stream Egk is decoded by 
the shape decoding unit 170 (Step Sg16) while the sep- 
arated transparency bit stream Egt is decoded by the 
texture decoding unit 180f (Step Sg17). 

That is, the coded shape bit stream Egk is decoded 
by the shape decoding unit 1 70, like the coded shape bit 
stream E2k of the binary image signal. When decoding 
of the coded shape bit stream Egk corresponding to one 
block has ended, the arithmetic decoder 1 71 detects the 
end of the coded shape bit stream Egk corresponding to 
this block and outputs an end detection signal Te toward 
the data analyzer 160g. Receiving the end detection 
signal Te, the data analyzer 160g controls the switch 
101 g with the switch control signal SWb so that the 
input coded image signal is input to the texture decod- 



ing unit 1801 

Thereby, the coded transparency bit stream Egt is 
sent from the data analyzer 160g to the texture decod- 
ing unit 1801. In the decoding unit 180f, the quantization 

5 scale and the DCT coefficients in each bit stream are 
input to the inverse quantizer 181f, and the DCT coeffi- 
cients are subjected to inverse quantization. Thereafter, 
the inversely quantized signal Diq is subjected to 
inverse cosine transformation by the inverse cosine 

10 transformer 182f. 

Meanwhile, the transparency motion vector MV is 
input to the texture motion compensator 1 841 The com- 
pensator 184f generates an address for obtaining a pre- 
diction transparency signal according to the 

is transparency motion vector MV, and obtains the predic- 
tion texture signal Emg from the frame memory bank 
102f using the address. 

The adder 1 83 performs addition of the output Didct 
from the inverse cosine transformer I82f and the predic- 

20 tion transparency signal Emg, and outputs the decoded 
transparency signal Dgt. The decoded transparency 
signal Dgt is input to the frame memory bank 102f and 
the synthesizer 190g. The synthesizer 190g synthe- 
sizes the reproduced transparency signal Dgt and the 

25 corresponding reproduced shape signal Dgk with a 
desired image to generate synthesized image data 
Dsyn. The synthesized image data Dsyn is input to the 
display unit 104 to be displayed as an image (Step 
Sgl8). When decoding of the coded transparency bit 

30 stream has ended, the data analyzer I60g detects the 
end of the coded transparency bit stream, and controls 
the switch 1 0lf by the switch control signal SWg so that 
the input coded image signal is input to the shape 
decoding unit 170. 

35 Thereafter, it is decided whether the input coded 
signal is data of the last block of the last frame (Step 
Sg19). When it is not data of the last block of the last 
frame, an input coded data corresponding to the next 
block is subjected to Steps Sgl, Sgl2, and 

40 Sgl4~Sgl 9. Conversely, when it is data of the last block 
of the last frame, decoding of the input coded signal is 
ended. 

In this eleventh embodiment, with respect to pixels 
wherein the reproduced shape signal is zero, pixel val- 

45 ues in the reproduced first and second arbitrary shape 
signal and the reproduced transparency signal are 
replaced with pixel values of a prescribed image. The 
prescribed image is an image prepared in advance at 
the receiving end, or an image reproduced by another 

so image decoding apparatus. 

As described above, according to the eleventh 
embodiment of the invention, the image decoding appa- 
ratus 100g includes the data analyzer 160g which ana- 
lyzes the input coded data. The data analyzer 160g 

55 detects an image identifier showing which coded signal 
is the input coded image signal among the coded binary 
signal E2, the first and second coded arbitrary shape 
signals Ep and Ex, the coded rectangle signal Et, and 



34 



67 



EP 0 891 093 A2 



68 



the coded transparency signal Eg and, according to the 
value of the image identifier, the analyzer 160g controls 
the switch 101g so that the input coded signal is appro- 
priately input to one of the shape decoding unit 1 70 and 
the texture decoding unit 180f. Therefore, the above- 
mentioned five coded image signals having different 
data structures can be decoded by decoding processes 
corresponding to a single coding method. 

Furthermore, in this eleventh embodiment, the 
coded texture bit stream and the coded transparency bit 
stream are decoded by the same decoding method in 
the texture decoding unit I80f. However, the coded 
transparency bit stream may be decoded by a decoding 
method different from that for the coded texture bit 
stream, as described for the ninth embodiment. 

Moreover, in this eleventh embodiment, emphasis 
has been placed on the image decoding apparatus 
100g which identifies the coded signals having the data 
structure described for the tenth embodiment, i.e., first 
and second coded arbitrary shape signals, a coded 
binary signal, a coded transparency signal, and a coded 
rectangle signal, using their image identifiers, and per- 
forms decoding processes according to the respective 
data structures. However, when the image input unit 
1 10 in the image coding apparatus 100a according to 
the second embodiment shown in figure 3 is con- 
structed so that it can identify an arbitrary shape image 
signal, a binary image signal, a rectangle image signal, 
an arbitrary shape transparency signal, and an arbitrary 
shape image signal with transparency information, it is 
possible to implement an image coding apparatus that 
performs coding of these image signals with image 
identifiers for identifying the coded signals correspond- 
ing to these image signals. 

When a coding or decoding program for implement- 
ing any of the aforementioned image coding apparatus, 
image decoding apparatuses, image coding method, 
and image decoding methods, is recorded on a data 
storage medium (e.g., a floppy disk), the image 
processing according to any of the aforementioned 
embodiments can be easily executed in an independent 
computer system. 

Figures 21(a)-21(c) are diagrams for explaining 
when the image coding process according to the sec- 
ond embodiment or the image decoding process 
according to any of the third, fourth, fifth, seventh, ninth, 
and eleventh embodiments is executed by a computer 
system using a floppy disk which contains the coding or 
decoding program. 

Figure 21 (a) shows a front view of a floppy disk FD, 
a cross-sectional view thereof, and a floppy disk body D 
Figure 21(b) shows an example of a physical format of 
the floppy disk body D. 

The floppy disk body D is contained in a case FC, 
thereby providing the floppy disk FD. On the surface of 
the disk body D, a plurality of tracks Tr are formed con- 
centrically from the outer circumference of the disk 
toward the inner circumference. Each track is divided 



into 16 sectors in the angular direction. Therefore, in the 
floppy disk FD containing the above-mentioned pro- 
gram, data of the program are recorded in assigned 
sectors on the floppy disk body D. 
5 Figure 21(c) shows the structure for recording the 
program in the floppy disk FD and for performing image 
processing using the program recorded in the floppy 
disk FD. 

To be specific, when the program is recorded in the 

10 floppy disk FD, data of the program are written in the 
floppy disk FD from the computer system Cs through 
the floppy disk drive FDD. When the above-mentioned 
image coding apparatus or image decoding apparatus 
is constructed in the computer system Cs from the pro- 

15 gram recorded in the floppy disk FD, the program is read 
from the floppy disk FD by the floppy disk drive FDD and 
then loaded to the computer system Cs. 

Although in the above description a floppy disk is 
employed as a data storage medium, an optical disk 

20 may be employed. Also in this case, image coding or 
decoding can be carried out by software, in similar man- 
ner to the above-mentioned case of using the floppy 
disk. The storage medium is not restricted to the floppy 
disk and the optical disk, and any medium may be 

25 employed as long as it can contain the program, for 
example, an IC card or a ROM cassette. 

Furthermore, there is a case where coded image 
signals having different data structures, such as a 
coded binary signal, a coded rectangle signal, first and 

30 second coded arbitrary shape signals, and a coded 
transparency signal, are stored in a data storage 
medium such as an optical disk. 

Assuming that the coded image signals having dif- 
ferent data structures and stored in a data storage 

35 medium are coded image signals having the data struc- 
tures with image identifiers according to any of the first, 
sixth, eighth, and tenth embodiments, when these 
coded image signals are read from the medium and 
decoded, the respective coded signals can be identified 

40 with the identifiers and decoded adaptively to the data 
structures of the coded signals by an image decoding 
method or an image decoding apparatus corresponding 
to a single coding method. For example, coded image 
signals having different structures, such as a coded 

45 binary signal and a coded arbitrary shape signal, can be 
decoded. In this case, with respect to the coded binary 
signal, addition of a dummy coded texture signal when 
the binary signal is created is dispensed with, whereby 
unwanted increase in the coded bit number is almost 

so avoided. 

Claims 

1. A data structure for transmitting a coded image sig- 
55 nal obtained by coding a digital image signal, 
including: 

at least coded shape bit streams between the 
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following two kinds of bit streams: coded shape 
bit streams obtained by coding a shape signal 
which represents a display image of binary 
information or the shape in binary format of 
each object as one of the components of a dis- 5 
play image; and coded pixel value bit streams 
obtained by coding a pixel value signal repre- 
senting the gradation of the object; and 
an image identifier for deciding whether the 
coded image signal includes, as the coded bit w 
streams, both of the coded shape bit streams 
and the coded pixel value bit streams, or only 
the coded shape bit streams; 
wherein the image identifier and the coded bit 
streams are arranged so that the image identi- 15 
f ier is followed by the coded bit streams. 

2. The data structure for image transmission accord- 
ing to Claim 1 wherein the image identifier com- 
prises a 2-bit code. 20 

3. An image coding method receiving a digital image 
signal, and subjecting the digital image signal to a 
coding process according to the data structure of 

the digital image signal, wherein: 25 

it is decided whether the digital image signal is 
a binary image signal including, as data for dis- 
play, only a shape signal representing a display 
image of binary information, or an arbitrary 30 
shape image signal including, as data for dis- 
play, both of a shape signal representing the 
shape of each object as one of the components 
of a display image and a pixel value signal rep- 
resenting the gradation of the object; 35 
for the binary image signal, the shape signal is 
subjected to a first coding process and, in the 
coding process, an image identifier having a 
first value is generated, thereby creating a 
coded binary signal including the image identi- 40 
fier; 

for the arbitrary shape image signal, the shape 
signal is subjected to the first coding process 
while the pixel value signal is subjected to a 
second coding process employing a coding 45 
method different from that of the first coding 
process and, in the coding process, an image 
identifier having a second value is generated, 
thereby creating a coded arbitrary shape signal 
including the image identifier; and so 
in response to the input digital image signal, 
one of the coded binary signal and the coded 
arbitrary shape signal is output. 

4. The image coding method of Claim 3 wherein the 55 
image identifier comprises a 2-bit code. 

5. An image coding apparatus receiving a digital 



image signal, and subjecting the digital image sig- 
nal to a coding process according to the data struc- 
ture of the digital image signal, comprising: 

signal identification means for receiving the 
digital image signal, and deciding whether the 
digital image signal is a binary image signal 
including, as data for display, only a shape sig- 
nal representing a display image of binary 
information, or an arbitrary shape image signal 
including, as data for display, both of a shape 
signal representing the shape of each object as 
one of the components of a display image and 
a pixel value signal representing the gradation 
of the object, and outputting an identifier signal 
according the result of the decision; 
signal extraction means for extracting the 
shape signal from the binary image signal, and 
the shape signal and the pixel value signal from 
the arbitrary shape image signal; 
first coding means for coding the shape signals 
by a first coding process to generate coded 
shape bit streams; 

second coding means for coding the pixel value 
signal by a second coding process employing a 
coding method different from that of the first 
coding process, thereby generating coded pixel 
value bit streams; 

signal supply means for selecting, according to 
the identifier signal, one of first and second 
operations, where the first operation is to sup- 
ply the shape signal of the binary image signal 
to the first coding means, and the second oper- 
ation is to supply the shape signal of the arbi- 
trary shape image signal to the first coding 
means while supplying the pixel value signal of 
the arbitrary shape image signal to the second 
coding means; and 

multiplexing means for multiplexing the identi- 
fier signal from the signal identification means, 
the coded shape bit streams output from the 
first coding means, and the coded pixel value 
bit streams output from the second coding 
means; 

wherein a coded binary signal including the 
identifier signal and the coded shape bit 
streams is output when the binary image signal 
is input as the digital image signal, and a coded 
arbitrary shape signal including the identifier 
signal, the coded shape bit streams, and the 
coded pixel value bit streams is output when 
the arbitrary shape image signal is input as the 
digital image signal. 

6. An image decoding method receiving, as a coded 
signal obtained by coding a digital image signal, a 
coded image signal having an image identifier 
according to the data structure of the digital image 
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signal, and subjecting the coded image signal to a 
decoding process according to the data structure, 
wherein: 

the coded image signal is analyzed with refer- s 
ence to the image identifier to decide whether 
the coded image signal is a coded arbitrary 
shape signal including, as data for display, both 
of coded shape bit streams obtained by coding 
a shape signal which represents the shape of io 
each object as one of the components of a dis- 
play image and coded pixel value bit streams 
obtained by coding of a pixel value signal rep- 
resenting the gradation of the object, or a 
coded binary signal including, as data for dis- is 
play, only coded shape bit streams obtained by 
coding a shape signal representing a display 
image of binary information; 
when the input coded image signal is the coded 
arbitrary shape signal, decoding of the coded 20 
shape bit streams thereof by a first decoding 
method, and decoding of the coded pixel value 
bit streams thereof by a second decoding 
method, are carried out according to the 
arrangement of these bit streams; and 25 
when the input coded image signal is the coded 
binary signal, only decoding of the coded 
shape bit streams thereof by the first coding 
method is carried out. 

30 

The image decoding method of Claim 6 wherein the 
image identifier comprises a 2-bit code. 

An image decoding apparatus receiving, as a 
coded signal obtained by coding a digital image sig- 35 
nal, a coded image signal having an image identi- 
fier according to the data structure of the digital 
image signal, and subjecting the coded image sig- 
nal to a decoding process according to the data 
structure, comprising: to 

data analysis means for deciding whether the 
coded image signal is a coded arbitrary shape 
signal including, as data for display, both of 
coded shape bit streams obtained by coding a 45 
shape signal which represents the shape of 
each object as one of the components of a dis- 
play image and coded pixel value bit streams 
obtained by coding a pixel value signal repre- 
senting the gradation of the object, or a coded so 
binary signal including, as data for display, only 
coded shape bit streams obtained by coding a 
shape signal representing a display image of 
binary information; 

first decoding means for decoding the coded 55 
shape bit streams by a first decoding process 
to generate a decoded shape signal; 
second decoding means for decoding the 



coded pixel value bit streams by a second 
decoding process employing a decoding 
method different from that of the first decoding 
process, thereby generating a decoded pixel 
value signal; 

signal switch means for supplying the coded 
image signal to one of the first and second 
decoding means, according to switch control 
signals; 

said data analysis means supplying a first 
switch control signal to the signal switch means 
when the coded image signal is the coded 
binary signal, and supplying a second switch 
control signal to the signal switch means when 
the coded image signal is the coded arbitrary 
shape signal; and 

said signal switch means being in the fixed 
switching state where the coded shape bit 
streams of the coded binary signal are input to 
the first decoding means alone when it 
receives the first switch control signal, and 
being in the alternate switching state where, 
according to the arrangement of the coded 
shape bit streams and the coded pixel value bit 
streams in the coded arbitrary shape signal, 
the coded shape bit streams are input to the 
first decoding means while the coded pixel bit 
streams are input to the second deciding 
means, when it receives the second switch 
control signal. 

9. The image decoding apparatus of Claim 8 further 
comprising over-load detection means for detecting 
that the load on the decoding process by one of the 
first and second decoding means exceeds a thresh- 
old value which is set in advance, and outputting an 
over-load detection signal toward the data analysis 
means, wherein: 

when the over-load detection signal is input to 
the data analysis means, the data analysis 
means outputs a third switch control signal 
toward the signal switch means; and 
in response to the third switch control signal, 
the signal switch means supplies either of the 
coded shape bit streams and the coded pixel 
value bit streams in the coded arbitrary shape 
signal to one of the first and second decoding 
means. 

10. The image decoding apparatus of Claim 8 further 
comprising control signal input means for inputting 
a manual control signal from the outside to the data 
analysis means, wherein: 

when the data analysis means receives the 
manual control means, it outputs a third switch 
control signal toward the signal switch means; 
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and 

in response to the third switch control signal, 
the signal switch means supplies the coded 
shape bit streams of the coded arbitrary shape 
signal to the first decoding means, and does s 
not supply the coded pixel value bit streams to 
the second decoding means. 

11. A data structure for transmitting a coded image sig- 
nal obtained by coding a digital image signal, 
including: 

at least one of the following two kinds of coded 
bit streams: coded shape bit streams obtained 
by coding a shape signal which represents a 
display image of binary information or the 
shape in binary format of each object as one of 
the components of a display image; and coded 
pixel value bit streams obtained by coding a 
pixel value signal representing the gradation of 
the display image or the object; and 
an image identifier comprising a multiple-bit 
code, for deciding whether the coded image 
signal includes, as the coded bit streams, at 
least the coded shape bit streams, or only the 
coded pixel value bit streams; 
wherein the image identifier and the coded bit 
streams are arranged so that the image identi- 
fier is followed by the coded bit streams. 

12. An image decoding method receiving, as a coded 
signal obtained by coding a digital image signal, a 
coded image signal having an image identifier 
according to the data structure of the digital image 
signal, and subjecting the coded image signal to a 
decoding process according to the data structure, 
wherein: 

the coded image signal is analyzed with refer- 
ence to the image identifier to decide which of 
the following three coded signals is the coded 
image signal; 

a coded arbitrary shape signal including, 
as data for display, both of coded shape bit 
streams obtained by coding a shape signal 
which represents the shape of each object 
as one of the components of a display 
image, and coded pixel value bit streams 
obtained by coding a pixel value signal rep- 
resenting the gradation of the object, 
a coded pixel value signal including, as 
data for display, only coded pixel value bit 
streams obtained by coding a pixel value 
signal representing the gradation of a dis- 
play image, and 

a coded binary signal including, as data for 
display, only coded shape bit streams 



obtained by coding a shape signal repre- 
senting a display image of binary informa- 
tion; 

when the input coded image signal is the coded 
pixel value signal, decoding of the coded pixel 
value bit streams thereof is carried out; and 
when the input coded image signal is the coded 
binary signal or the coded arbitrary shape sig- 
nal, decoding of the coded bit streams included 
in these signals is stopped. 

13. An image decoding apparatus receiving, as a 
coded signal obtained by coding a digital image sig- 
nal, a coded image signal having an image identi- 
fier according to the data structure of the digital 
image signal, and subjecting the coded image sig- 
nal to a decoding process according to the data 
structure, comprising: 
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SO 

data analysis means for analyzing the coded 
image signal according to the image identifier 
to decide which of the following three coded 
signals is the coded image signal; 

25 

a coded arbitrary shape signal including, 
as data for display, both of coded shape bit 
streams obtained by coding a shape signal 
which represents the shape of each object 
30 as one of the components of a display 

image, and coded pixel value bit streams 
obtained by coding a pixel value signal rep- 
resenting the gradation of the object, 
a coded pixel value signal including, as 
35 data for display, only coded pixel value bit 

streams obtained by coding a pixel value 
signal representing the gradation of a dis- 
play image, and 

a coded binary signal including, as data for 
40 display, only coded shape bit streams 

obtained by coding a shape signal repre- 
senting a display image of binary informa- 
tion; 

45 decoding means for decoding the coded pixel 

value bit streams to generate a decoded pixel 
value signal; 

signal discarding means for discarding the 
coded image signal; 
so signal switch means for supplying the coded 

image signal to one of the decoding means and 
the signal discarding means according to a 
switch control signal; 

said data analysis means supplying a first 
55 switch control signal to the signal switch means 

when the coded image signal is the coded pixel 
value signal, and supplying a second switch 
control signal to the signal switch means when 
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the coded image signal is the coded arbitrary 
shape signal or the coded binary signal; and 
said signal switch means supplying the coded 
pixel value bit streams of the coded pixel value 
signal to the decoding means when it receives 5 
the first switch control signal, and supplying the 
coded bit streams included in the coded arbi- 
trary shape signal and the coded binary signal 
to the signal discarding means when it receives 
the second switch control signal. io 

14. A data structure for transmitting a coded image sig- 
nal obtained by coding a digital image signal, 
including: 

15 

at least one of the following three kinds of 
coded bit streams: coded shape bit streams 
obtained by coding a shape signal which repre- 
sents a display image of binary information or 
the shape in binary format of each object as 20 
one of the components of a display image; 
coded pixel value bit streams obtained by cod- 
ing a pixel value signal representing the grada- 
tion of the display image or the object; and 
coded transparency bit streams obtained by 25 
coding a multivalued transparency signal rep- 
resenting the transparency of the object; and 
an image identifier comprising a multiple-bit 
code, for deciding which of the following four 
coded image signals is the coded image signal; 30 

a coded pixel value signal including, as the 
coded bit streams, only the coded pixel 
value bit streams, 

a first coded arbitrary shape signal includ- 35 
ing, as the coded bit streams, the coded 
shape bit streams and the coded pixel 
value bit streams, 

a coded binary signal including, as the 
coded bit streams, only the coded shape «o 
bit streams, and 

a second coded arbitrary shape signal 
including, as the coded bit streams, the 
coded shape bit streams, the coded pixel 
value bit streams, and the coded transpar- 15 
ency bit streams; 

wherein the image identifier and the coded bit 
streams are arranged so that the image identi- 
fier is followed by the coded bit streams. so 

15. An image decoding method receiving, as a coded 
signal obtained by coding a digital image signal, a 
coded image signal having an image identifier 
according to the data structure of the digital image 55 
signal, and subjecting the coded image signal to a 
decoding process according to the data structure, 
wherein: 



the coded image signal is analyzed with refer- 
ence to the image identifier to decide which of 
the following four coded signals is the coded 
image signal; 

a first coded arbitrary shape signal includ- 
ing, as data for display, coded shape bit 
streams obtained by coding a shape signal 
representing the shape of each object as 
one of the components of a display image, 
and coded pixel value bit streams obtained 
by coding a pixel value signal representing 
the gradation of the object, 
a second coded arbitrary shape signal 
including, as data for display, in addition to 
coded shape bit streams and coded pixel 
value bit streams, coded transparency bit 
streams obtained by coding a multivalued 
transparency signal representing the 
transparency of the object, 
a coded pixel value signal including, as 
data for display, only coded pixel value bit 
streams obtained by coding a pixel value 
signal representing the gradation of a dis- 
play image, and 

a coded binary signal including, as data for 
display, only coded shape bit streams 
obtained by coding a shape signal repre- 
senting a display image of binary informa- 
tion; 

when the input coded image signal is the first 
coded arbitrary shape signal, decoding of the 
coded shape bit streams thereof by a first 
decoding method, and decoding of the coded 
pixel value bit streams thereof by a second 
decoding method, are carried out according to 
the arrangement of these bit streams; 
when the input coded image signal is the sec- 
ond coded arbitrary shape signal, decoding of 
the coded shape bit streams thereof by the first 
decoding method, decoding of the coded pixel 
value bit streams thereof by the second decod- 
ing method, and decoding of the coded trans- 
parency bit streams thereof by a third decoding 
method, are carried out according to the 
arrangement of these bit streams; 
when the input coded image signal is the coded 
pixel value signal, only decoding of the coded 
pixel value bit streams thereof by the second 
decoding method is carried out; and 
when the input coded image signal is the coded 
binary signals only decoding of the coded 
shape bit streams thereof by the first decoding 
method is carried out. 

1 6. A data structure for transmitting a coded image sig- 
nal obtained by coding a digital image signal, 
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including: 

at least one of the following three kinds of 
coded bit streams: coded shape bit streams 
obtained by coding a shape signal which repre- 5 
sents a display image of binary information or 
the shape in binary format of each object as 
one of the components of a display image; 
coded pixel value bit streams obtained by cod- 
ing a pixel value signal representing the grada- io 
tion of the display image or the object; and 
coded transparency bit streams obtained by 
coding a multivalued transparency signal rep- 
resenting the transparency of the object; 
an image identifier comprising a multiple-bit 15 
code, for deciding which of the following five 
coded image signals is the coded image signal; 

a coded pixel value signal including, as the 
coded bit streams, only the coded pixel 20 
value bit streams, 

a first coded arbitrary shape signal includ- 
ing, as the coded bit streams, the coded 
shape bit streams and the coded pixel 
value bit streams, 25 
a coded binary signal including, as the 
coded bit streams, only the coded shape 
bit streams. 

a second coded arbitrary shape signal 
including, as the coded bit streams, the 30 
coded shape bit streams, the coded pixel 
value bit streams, and the coded transpar- 
ency bit streams, and 
a coded transparency signal including, as 
the coded bit streams, the coded shape bit 35 
streams and the coded transparency bit 
streams; 

wherein the image identifier and the coded bit 
streams are arranged so that the image identi- 40 
tier is followed by the coded bit streams. 

17. An image decoding method receiving, as a coded 
signal obtained by coding a digital image signal, a 
coded image signal having an image identifier 45 
according to the data structure of the digital image 
signal, and subjecting the coded image signal to a 
decoding process according to the data structure, 
wherein: 

the coded image signal is analyzed with refer- 
ence to the image identifier to decide which of 
the following five coded signals is the coded 
image signal; 

55 

a first coded arbitrary shape signal includ- 
ing, as data for display, coded shape bit 
streams obtained by coding a shape signal 



showing the shape of each object as one 
of the components of a display image, and 
coded pixel value bit streams obtained by 
coding a pixel value signal representing 
the gradation of the object, 
a second coded arbitrary shape signal 
including, as data for display, in addition to 
coded shape bit streams and coded pixel 
value bit streams, coded transparency bit 
streams obtained by coding a multivalued 
transparency signal representing the 
transparency of the object, 
a coded transparency signal including, as 
data for display, coded shape bit streams 
obtained by coding a shape signal repre- 
senting the shape of the object, and coded 
transparency bit stream obtained by cod- 
ing a multivalued signal representing the 
transparency of the object, 
a coded binary signal including, as data for 
display, only coded shape bit streams 
obtained by coding a shape signal repre- 
senting a display image of binary informa- 
tion, and 

a coded pixel value signal including, as 
data for display, only coded pixel value bit 
streams obtained by coding a pixel value 
signal representing the gradation of a dis- 
play image; and 

when the input coded image signal is the first 
coded arbitrary shape signal, decoding of the 
coded shape bit streams thereof by a first 
decoding method, and decoding of the coded 
pixel value bit streams thereof by a second 
decoding method, are carried out according to 
the arrangement of these bit streams; 
when the input coded image signal is the sec- 
ond coded arbitrary shape signal, decoding of 
the coded shape bit streams thereof by the first 
decoding method, decoding of the coded pixel 
value bit streams thereof by the second decod- 
ing method, and decoding of the coded trans- 
parency bit streams thereof by a third decoding 
method, are carried out according to the 
arrangement of these bit streams; 
when the input coded image signal is the coded 
transparency signal, decoding of the coded 
shape bit streams thereof by the first decoding 
method, and decoding of the coded transpar- 
ency bit streams by the third decoding method, 
are carried out according to the arrangement of 
these bit streams; 

when the input coded image signal is the coded 
binary signal, only decoding of the coded 
shape bit streams thereof by the first decoding 
method is carried out; and 
when the input coded image signal is the coded 
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pixel value signal, only decoding of the coded 
pixel value bit streams thereof by the second 
decoding method is carried out. 

18. A data storage medium containing a program for s 
implementing a decoding process of a coded image 
signal by a computer, said program being con- 
structed so that a decoding process according to an 
image decoding method as defined in any of Claims 

6, 1 2, 1 5, and 1 7 is executed by the computer. w 

19. A data storage medium containing a coded image 
signal obtained by coding a digital image signal, 
said coded image signal having a data structure for 
image transmission as defined in any of Claims 1. rs 
11, 14, and 16. 
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